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TITLE OF THK TNVENTION 

METHODS FOR PROPAGATING ADENOVIRUS AND VIRUS PRODUCED THEREBY 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 The present application claims the benefit of application serial nos. 60/458,825, filed 

March 28, 2003; 60/455,312, filed March 17, 2003; 60/455,234, filed March 17, 2003; and 
60/405,182, filed August 22, 2002, 

FTFT.n OF THE INVENTION 

1 0 The present invention concerns various methods to propagate and rescue multiple 

serotypes of replication-defective adenovirus in a single adenoviral El -complementing cell line. 
TypicaUy, replication-defective adenovirus vectors propagate only in cell lines which express El 
proteins of the same serotype or subgroup as the vector. The methods disclosed herein ofifer the 
ability to propagate vectors derived firom multiple serotypes in a single cell line expressing El 

15 proteins fiom a single serotype. Such propagation of a wide range of vectors in orie cell Ime is 
accomplished by providing all or a portion of an E4 region zn cis within tiie genome of the 
replication-defective adenovirus. The added E4 region or portion thereof is cloned firom a virus 
of the same or highly similar serotype as that of the El gene product(s) of the complementing 
cell line. Interaction between the El gene products of the cell line and the heterologous E4 gene 

20 products of the replication-defective adenoviral vector enables the propagation and rescue of the 
recombinant replication-defective adenovirus vectors. The invention, therefore, bypasses an 
existing need in the art to customize complementing cell lines to the specific serotype or 
subgroup of the adenoviral vector being propagated or, alternatively, to have to transfect a cell 
line with an E4 region and then regulate the expression in trans of tiie E4 region within the El 

25 complementing cellline. 

PI ACKGRQUND OF THE INVENTION 

Beginning with the first human adenoviruses (Ads) isolated over four decades ago 
(Rowe et aL, Proc. Soc. Ejq>, Biol Med. 84:510-51% 1953), over 100 distinct serotypes of 

30 adenovirus have been isolated which infect various mammalian species, 5 1 of which are of 
human origin (Straus, Adenovirus infections in humans. In Hie Adenoviruses. 451-498, 1984; 
Hierholzer a/., Infect Dis., 158: 804-813, 1988; Schnuir and Dondero, Intervirology.. 36: 
79-83, 1993; Jong et al, J Clin Microbiol, 57:3940-3945:1999). The human serotypes have 
been categorised into six subgenera (A-F) based on a number of biological, chemical, 

35 immunological and structural criteria; criteria which include hemagglutination properties of rat 
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and rhesus monkey erythrocytes, DNA homology, restriction enzyme cleavage patterns, 
percentage of CHC content and oncogenicity (Straus, Adenovirus infections in humans, hi The 
Adenoviruses. 451-498, 1984; Horwitz, Adenoviridae and Iheir replication. In Virology: 1679- 
172, 1990). 

5 Deletion of an essential El region common to Ac various adeiwvirus serotypes 

has enabled the use of adenovirus vectors as gene transfer vectors for vaccine and gene therapy 
purposes. Resultant replication-defective vectors are propagated in cell lines lhat provide tiie 
deleted El gene products in trans. Supplanentation of the essential El gene products in treats in 
this manner woiks well when the El gene products axe from the same or a highly similar 

10 serotype. As such, El-deleted group C serotypes (Adl, Ad2, Ad5 and Ad6) grow weU in 293 or 
PER.C6 cells which contain and eacpress the Ad5 El region. In contrast. El -deleted serotypes 
other than group C, for example those from subgroups A, B, D, E, and F (e.g.. Ad3, Ad4, and 
Ad7 to Ad5 1), do not replicate efficiently in 293 or PER.C6 cells. The Ad5 El sequences in 293 
and PER.C6 cells do not ftilly complement the rq)lication of Ihese altranative serotypes. This 

1 5 presents a challenge due to the fact that the most characterized and studied complementing cell 
lines available for growth and propagation of adenovirus are based on El sequence from 

adenovirus serotype 5. 

This inability to fully complement the replication of serotypes other than group C 
adenovirus in Ad5 El complementing cell lines has been attributed to Ae inability of Ad5 (group 
20 C) Elb 55K gene product to functionally interact with the E4 gene products of non-group C 
serotypes. While the interaction is conserved within members of the same subgroup, it is not 
well conserved between subgroups. 

Hence, cell lines expressing both Ad5 El and 0RF6 ware graierated and proved 
useful in coraplemraiting alternative adenovirus serotypes; see, e.g., Abrahamsen et al, 1997 J. 
25 Virol. 8946-895 1 . Such incorporation of E4 (or 0RF6) into Ad 5 complementing cell lines as 
was done in Abrahamsen e/ aZ., 5upra, is known. 

U.S. Patent No. 5,849,561 discloses complementation of an El -deleted non-group 
C adenovirus vector in an Ad5-El complementing cell line which also expresses portions of the 
Ad5-E4gene. 

3Q U.S. Pateaat No. 6,127,175, issued to Vigne, et al., discloses a stably transfected 

ma mmalian cell line which e3q)resses a portion of the B4 region of adenovirus, preferably 0RF6 
or ORF6/7. Such a cell line is useful for complemaitation of recombinant Ad genomes deficient 
in the E4 region. 

European Application EP 1 054 064 Al discloses recombinant, rq)Ucation 
35 deficient adenovirus 35 (Ad35) vectors and cell lines which complement in trans the growth of 
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these vectors. A cefl line which expresses AdSElA and E2A genes (PER.C6) was shown to 
complemeait an Ad35-El deleted vector upon co-ej^ression of Ad35-EIB proteins. 

U.S. Patent No. 6,270,996, issued to Wilson, et al.y discloses E1/E4 deleted 
adenovirus vectors and El/E4(ORF6) cell lines which complement in trans virus growth without 

S resulting in cell toxicity. 

U.S. Patent No. 6^202,060, issued to Mehtali, e/ al., discloses adenoviral vectors 
wherein portions of the early genes are under control of an inducible promoter. The '060 patent 
also discloses complementing cell lines which may be used in tandem with these Ad vectors. 

The generation of seiotype-specific cell lines providing a con5)lementing 

10 serotype-specific El gene product(8) in is known as well 

Although Ad5-based vectors have been used extensively in a mrniber of gene 
therapy trials, there may be limitations on the use of Ad5 and other group C adenoviral vectors 
due to preexisting immunity in die general population due to natural infection. Ad5 and other 
gtoiq) C members tend to be among the most seroprevalent serotypes. Immunity to existing 

15 vectors may develop as a result of exposure to the vector during treatment These types of 
preexistii^ or developed immunity to seroprevalent gene delivery vectors may limit Ihe 
effectiveness of gme therapy or vaccination efforts. Alternative adenovirus serotypes, thus, 
constitute very important targets m Ihe pursuit of gene deUvery systems capable of evading the 
host immune response. 

20 There ronains both a practical and commercial need for an adenovirus-based 

vaccine and/or gene therapy delivery system which allows for the production of multiple 
serotype recombinant adenovirus vectors in a single source conq)lementing mammalian ceU hne. 
The present invention addresses and overcomes this deficiency in the art by disclosing novel 
methods for propagating multiple serotype recombuiant Ad vectors in a sin^e complanrating 

25 cell line where the required serotype-specific sequences are provided in cis. 

RTIMMARY QF THF. INVENTION 

The present invention relates to an enhanced means for propagating replication- 
defective adenovirus in an El-complementing cell line(s) where the El gene product(s) being 

30 expressed is not native to the adenovirus being propagated. Tbe method is based on Applicants' 
finding that siqiply, in cis, of a nucleic acid sequence encoding all or a portion of a het^ologous 
adenoviral E4 region which is native to a virus of die same or highly similar serotype as the El 
gene product(s) of the complementing cell line enables the growtii of adenoviral vectors of 
varying serotype in any single complementing cell line, despite the feet the cell line is not 

35 customized for Ifae particular serotype of vector being propagated. This is of particular 
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importance given that existing and settled adenoviral El-complementing cell lines (such as 
PER.C6'™ and 293) are based on one of the most prominent adenovirus serotypes (Ad5) and are 
not suited for the large-scale propagation and rescue of alternative serotypes. 

The basic stqjs involved in the propagation of adenoviral vectors m accordance 

5 with tiie methods of the instant invention are as follows: First, all or a portion of a heterologous 
adenoviral E4 region conq>rising nucleic acid sequence encoding at least open reading frame 6 
(0RF6) is inserted into a replication-defective adenoviral vector. By ^^heterologous" Applicants 
mean that the nucleic acid sequence is not native to fte viral vector being propagated, ie., not 
normally present within a virus of the same or highly similar serotype. As will be described, the 

10 adenoviral E4 region or portion thereof can be either a nucleic acid sequence encoding ORF 6 or 
any larger portion of the E4 region, and includes nucleic add comprising the complete E4 region 
with E4 promoter. The region into which the nucleic acid is incorporated is not limited, Le., die 
insertion can be made into the complete E4 region with E4 promoter or into a smaller portion 
narrowing into the 0RF6 region. Alternatively, the heterologous E4 region or portion thereof 

15 can be inserted into diflFerent areas of the genome such as the El or E3 regions. Further, the 

native E4 region or portion thereof can be deleted and replaced, or left intact This is not deemed 
a critical element of the instant invention. What is a critical element is fliat the heterologous E4 
region or portion thereof being inserted is native to a virus of the same or highly similar serotype 
as the El gene product(s) e>q>ressed by the complementing cell line. 

20 Followmg the modification of the adraoviral vector of interest, the recombinant 

adenovirus is then introduced into an adenoviral El -complementing cell line and allowed to 
propagate. The adenovirus is subsequently harvested and rescued from the complementing ceU 
line. 

The resultant virus can be studied and used in varioxis gene therapy and vaccine 
25 efforts. Thevirus, therefore, forms an important aspect of the instant invention. 

T^RIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 illustrates a transcription map for adenovirus serotype 5. The linear 
genome is divided into 100 map units as well as mto r- and 1- strands which designate the 
30 direction of transcription. Early transcription units are designated with an E and are active prior 
to viral DNA replication. Late transcription units are designated with and L and are active 
primarily sft&c DNA replication. Promoters are represented as brackets and polyadenylation 
sites as arrowheads. The tripartite leader is designated 1, 2, and 3. 

FIGURES 2A-1 through 2 A- 10 illxistrate the nucleic acid sequence for the wild- 
35 type adenovirus 35 (SEQ ID NO: 1) utilized m the Examples. 
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FIGURE 3 illustrates the homologous recombinatioii scheme utilized to recover 

pAd35AEl. 

FIGURE 4 illustrates the various configurations of the E4 regions (or portions) 

within the altOTiative serotype recombinants. 
5 FIGURE 5 illustrates the homologous recombination scheme utilized to recover 

pAd35AElAE4Ad50rf6. 

FIGURE 6 illustrates the nucleic acid sequence encoding the gag expression 
cassette (SEQ ID NO: 2). The various regions of the figure are as foUows: (1) a first underlined 
segment of nucleic acid sequence encoding the immediate early gene promoter region fi-om 

1 0 human cytomegalovirus; (2) a first segment of lowercase letters which is not underlined, which 
segment of DNA contains a convenient restriction enzyme site; (3) a region in caps which 
contains the coding sequrace of HIV- 1 gag; (4) a second segment of lowercase letters which is 
not underlined, which segment of DNA contains a convenient restriction enzyme site; and (5) a 
second underlined segment, this segment containing nucleic acid sequence encoding a bovine 

15 growth hormone polyadenylation signal sequence. 

FIGURE 7 illustrates the nucleic acid sequence encoding the SEAT expression 
cassette (SEQ ID NO: 3). The various regions of the figure are as follows: (1) a first underiined 
segment of nucleic acid sequence encoding the inamediate early gene promoter region fi:om 
human cytomegalovffus; (2) a first segment of lowercase letters which is not underlined, which 

20 segment of DNA contains a convenient restriction enzyme site; (3) a region in caps which 
contains the coding sequence of the human placental SEA? gene; (4) a second segment of 
lowercase letters which is not underlined, which segment of DNA contains a convenient 
restriction enzyme site; and (5) a second underlined segment, this segment containing nucleic 
acid sequence encoding a bovine growth hormone polyadenylation signal sequence. 

25 FIGURE 8 illustrates in vivo expression of SEAT in C3H/HeN mice using lO'^lO 

vp doses of Ad35 vectors. This experiment was designed to address any effects of E3 deletion. 
The vectors were injected intramuscularly and the levels of SEAP expression were determined 
from the serum samples. Shown are geometric means for each cohort of 5 mice. 

FIGURE 9 illustrates in vivo expression of SEAP in C3H/HeN mice using 10^10 

30 vp doses of Ad35 vectors. This experiment was designed to address any effects of Ad5 sequence 
insertion into the Ad35 genome. The vectors were injected intramuscularly and the levels of 
SEAP expression were determmed firom the serum samples. Two extra cohorts received 10"^ 10 
vp and 10^9 vp of Ad5 vector. Shown are geometric means for each cohort of 5 mice. 

FIGURES lOA-B illustrate in vivo SEAP expression usmg MRKAdS-based (A) 

35 and Ad35AEl AE4Ad50rf6-based (B) vector in rhesus macaques. Shown are the serum antigen 
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levels for individual monkeys following a single intramuscular (ijn.) injection of 10^1 1 vp 
MRKAdSSEAP (filled circles), 10^9 vp MRKAd5SEAP (open boxes) or 10^1 1 vp 
Ad35AElSEAPAE4Ad50rf6. 

FIGURE 1 1 illustrates in vivo SEAP expression in African green monkeys using 
5 Ad5- and Ad35-based vectors. Shown are the antigen levels for each animal in serum samples 

collected two days after &e treatment 

FIGURE 12 illustrates the homologous recombination scheme utilized to recover 

pAd24AEl. 

FIGURE 13 illustrates the homologous recombination sdheme utilized to recover 

10 pAd24AElAd50rf6. 

FIGURE 14 illustrates the configuration of E4 regions in the Ad24 recombinants 

goierated. 

FIGURE 15 illustrates the growth kinetics of the Ad24-based vectors in PER-C6 

cells. 

15 FIGURES 16A-1 through 16A-10 illustrate the nucleic acid sequence for wild- 

type adenovinis serotype 24 (SEQ ID NO: 5). The ATCC product number for Ad24 is VR-259. 

FIGURE 1 7 illustrates, in tabular format, gag-specific T cell responses in 
monkeys immunized with MRKAdS-HIVgag and Ad24 HIV vectors. Shown are the numbers of 
spot-forming cells per milHon, PBMC following incubation in the absence (mock) or presence of 

20 Gag peptide pool. The pool consisted of 20-aa peptide overlapping by 10 aa and encompassmg 

the entire gag sequmce. 

FIGURE 18 illustrates, in tabular format, the characterization of the gag-specific 

T cells in monkeys immunized with 10^1 1 vp of MRKAd5-HIVlgag and 
Ad24AElgagAQrf6Ad50rf5. Shown are the percentages of CD3+ T cells fliat are either gag- 
25 specific CD4+ or gag-specific CD8+ cells. These values ware corrected for mode values 
(<0.03%). 

FIGURE 19 illustrates individual anti-p24 titers (m mMU/mL) in macaques 

immunized witii gag-expressing adenovirus vectors. 

HGURE 20 iUustrates in vivo expression of SEAP in C3H/HeN mice using 10^10 
30 vp doses of Ad24 vectors. The vectors were injected intramuscularly and the levels of SEAP 
expression were determined from the serum samples. Two exfa:a cohorts received 10^10 vp and 
10-^9 vp of Ad5 vector. Shown are geometric means for each cohort of 5 mice. 

FIGURE 21 illustrates in vivo SEAP ejqjression using MRKAdS and Ad24 
vectors in rhesus macaques. Shown are the geometiric means of the SEAP levels for cohorts of 3 
35 monkeys. In bars are the standard errors of the gecmietric means. 
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FIGURE 22 illustrates a homologous recombination scheme to be utilized to 

recover 

pAd24AElAE4Ad50rf6. 

HGURE 23 iUustiates gag-specific T cell responses in rhesus macaques 
immunized following a heterologous Ad5/Ad6 prime-Ad24 boost regimen, a: Mock, no peptide: 
5 gag, 20-mer peptide pool encompassing entire gag sequence; b: Peak response after 2 or 3 doses 
of the priming vaccine; c: 3 wks prior to boost; d: 4 wks after boost; e: ND. not determined. 

FIGURE 24 illustrates, in tabular format, the percentages of CD3* T lynq)hocytes 
that are gag-specific CD8* ceUs or gag-specific CDA* cells determined after the Ad24 Boost 
hnmunization (wk 60). Numbers reflect the percentages of circulating CD3+ lymphocytes that 
10 are either gag-specific CD4+ or gag-specific CD8+ cells. Mock values (equal to or less than 

0.01%) have been subtracted. 

FIGURE 25 illustrates gag-specific T cell responses in rhesus macaques 
inununized fiillowing a heterologous Ad 24 prime-Ad5 boost regimen, a: Mock, no peptide: 
gag, 20-mer peptide pool encon]5)assing entire gag sequence; b: Peak response after 2 doses of 
1 5 the priming vaccine; c: Wk 24; d: 4 wks after boos^ e: ND, not determined. 

FIGURE 26 illustrates the homologous recombination scheme utiUzed to recover 

pAd34AElAE4Ad50rf6. 

FIGURE 27 illustrates the homologous recombination scheme utilized to recover 

pMRKAd34AElAE4Ad50rf6. 
20 FIGURES 28A- 1 to 28 A-9 illustrate a nucleic acid sequence for wild-type 

adenovirus serotype 34 (SEQ ID NO: 12). Ihe ATCC product number for Ad34 is VR-716. 

FIGURE 29 iUustrates the time course of SEAP expression using MRKAdS and 
Ad34 vectors in rhesus macaques. Data represent cohort geometric means. 

FIGURE 30 illustrates, in tabular format, T cell responses induced using 
25 MRKAdS and Ad34 vectors expressing HIV-1 gag. Data are expressed in numbers of spot- 
forming cells per milUon PBMC (SFOIO^ PBMC). "a" refers to a 20-mer peptide pool with 
10-aa ovarlap and encompassing the entire HIV-1 CAMl gag. 

FIGURE 31 iUustrates, in tabular format, ihe levels of CD4+ and CD8+ Gag- 
specific T cells in Ad34-immunized macaques at week 12. "a" refers to a 20-mer peptide pool 
30 with 10-aa overlap and encompassing the entire HIV-1 CAMl gag. 

FIGURE 32 iUustrates, in tabular format, T ceU responses induced using a 
heterologous Ad34 prime/Ad35 boost regimen in macaques, "a" refers to a 20-mer peptide pool 
with 10-aa overlap and enconq)assing the entire HIV-1 CAMl gag. 
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FIGURE 33 illustrates, in tabular fonnat, the levels of CD4+ and CD8+ Gag- 
specific T cells in Ad34 primed/Ad35 boosted macaques at week 28. "a" refers to a 20-mer 
peptide pool with 10-aa overlap and encompassing the entire HIV-1 CAMl gag. 



5 nFTAH RD DESCRIPTION OF THE INVENTION 

The present mvention details an efiBciCTit strategy for the propagation and rescue 
of alternative adenoviral serotypes utilizing available adenovirus production cell lines, nullifying 
the need to customize available cell lines for a specific serotype of interest. This is enabled by 
the incorporation of a critical E4 region uito the adenovirus to be propagated. 

1 0 The critical E4 region in the instant invention comprises, in the minimum, nucleic 

acid sequence encoding E4 ORF6 and can comprise the entire region of E4, mclusive of tiie 
promoter region. An important characteristic of the unported E4 region is that it is native to a 
virus of the same or highly sunilar serotype as the El gene product(s) (particularly ElB 55K) of 
the El-complementing cell Ime, but heterologous to {ue., non-native to a virus of the same 

15 serotype as) the adenoviral vector being propagated. As will be detailed below, the heterologous 
E4 region or portion thereof can be varied and can be inserted mto the vector backbone at 
numerous locations. 

The heterologous E4 region or portion thereof can, for instance, be a nucleic acid 
sequence encoding the entire open reading fi:ame of the non-native E4. This segment of nucleic 

20 acid sequence can, in turn, be incorporated into the '"native" entire E4 open reading fiame of the 
recipient virus. In such an embodiment, tiie promoter native to the adenovhal vector would 
drive the expression of the non-native E4 region within tiie recombinant replication-defective 
adenoviral vector. Alternatively, the nucleic acid sequence ^coding the entire open reading 
ftame can be inserted into a different region of the adenoviral vector genome, such as for 

25 example the El or E3 regions. In this latter embodiment, the native E4 region or portion thereof 

can be deleted or left intact. 

In another embodiment, the heterologous E4 region comprises a nucleic acid 
sequence encoding the entire open reading jGrame of E4 and includes a non-native E4 promoter. 
In this type of embodiment, the E4 region can be inserted into the location of the combined 
30 native E4 and E4 promote region. The non-native E4 region m this embodiment would be 
driven by expression of the non-native E4 promoter. Alternatively, the nucleic acid sequence 
encoding the entire open reading firame and the non-native E4 promoter can be inserted into a 
different region of the adenoviral vector genome, such as for example the El or E3 regions. In 
tills latter embodiment, the native E4 region or portion tiiereof can be deleted or left intact. 
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An alternative and further embodiment exists wherein the heterologous E4 region 
or portion thereof comprises nucleic acid sequence encoding a partial E4 region comprising 
ORF6 (one aspect of which is a region solely encoding ORF6). In this particular aspect of the 
invention, the heterologous non-native E4 protein can, in certain embodiments, rq)lace the non- 

5 native 0RF6 region or the entire E4-encoding region of the native virus. In the latter situation, 
ihe promoter driving expression of the non-native 0RF6 can either be the native E4 promoter or 
a heterologous, non-native promoter operatively Imked to the non-native ORF6, while in the 
latter, the expression of the non-native 0BJE6 would generally be driven by the native E4 
promoter. Alternatively, the nucleic acid sequence encoding a partial E4 region comprising ORF 

10 6 can be inserted into a different region of the adenoviral vector genome, such as for example the 
El or E3 regions. In this latter embodiment, the native E4 region or portion thereof can be 

deleted or left intact 

As one of skUl in the art can appreciate, there are various ways in which one can 

envision the si?)ply of a heterologous E4 nucleic acid sequence in cis to an adenoviral vector and 
15 thereby enable its growth based on AppUcants' novel findings herein. Moreover, as one of skill 
in the art can appreciate, either native or non-native promoters can beutilized to drive expression 
of the heterologous E4 region or portion thereof. 

Adenovirus pre-plasmids (plasmids comprising the genome of the repUcation- 
defective adenovirus with desired deletions and insertions) can be generated by homologous 
20 recombination using adenovirus backbones and an appropriate shutUe vector (designed to target- 
in specific deletions and incorporate desired restriction sites into the resultant plasmid). Shuttle 
vectors of use m this process can be generated using general methods widely understood and 
appreciated in the art, e.g.. PCR of the adenoviral terminal ends taking into account the desired 
deletions, and the sequential cloning of the respective segments into an ^ypropriate cloning 
25 plasmid. The adenoviral pre-plasmid can then be digested and transfected into the 

conq>lementing cell line via calcium phosphate co-precipitation or other suitable means. Vnus 
repUcation and amplification then occurs, a phenomenon made evident by notable cytopathic 
effect Infected cells and media are then harvested after viral rephcation is complete (generally, 

7-10 days post-transfection). 
3Q It is to be noted that various alternative adenoviral serotypes can be developed in 

accordance with the disclosed methods and, particularly, alternative adenoviral serotype vectors 
that were previously unable to be propagated or very inefBciently propagated utilizing existing 
adenoviral production cell lines based on subgroup C complementing El sequence. The various 
adenoviral vectors that can be developed in accordance with the instant methods include 
35 adenoviral vectors of subgroups A-F (for instance, serotypes of subgroups A, B (e.g.. serotypes 
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11, 14, 16, 21, 34 and 35), C (e.g., serotypes 2 and 5), D (e.g., serotypes 24, 26 and 36), E (e.g., 
serotype 4) and F. 

In preferred emboditQCTits, the various non-group C family members can be 
developed with heterologous E4 supplied from a subgroup C member such as adenovirus 

5 serotype 5. Particular embodiments of the instant invention utilize a development scheme 

wh^ein the heterologous E4 protem is derived from a wildtype adenovirus serotype 5 sequence; 
see, e.g., a viral sequence which has been deposited with the American Type Culture Collection 
("ATCC") under ATCC Deposit No. VR-5 (for which a transcription map can be found in Figure 
1). A particular example of this type of embodiment is wherein an adenovirus of subgroup B (or 

10 any non-C subgroup) comprising heterologous E4 proteins in cis from Ad5 is propagated in Ad5 
El-complementing cell Imes, for instance, PER.C6™ or 293. Applicants have, in fact, 
successfully propagated El- serotypes 10, 24, 34, and 35 via use of this particular embodiment 

One of skill in the art can readily identify alternative adenovirus serotypes (e.g., 
alternative serotypes of subgroups A, B (e.g., serotypes 1 1, 14, 16, 21, 34 and 35), C, (e.g., 

15 serotypes 2 and 5), D (e.g., serotypes 24, 26 and 36), E (e.g., serotype 4) and F) for the supply of 
the heterologous E4 protein. As long as the heterologous E4 region (or portion thereof 
comprising 0RF6) of the vector is native to a virus of the same or highly similar serotype as the 
El region of the complementing cell line, the methods of the instant invention are widely 
applicable to the propagation and rescue of adenovirus of all serotypes. In light of the present 

20 disclosure, one can readily envision, for instance, how a complementing cell line based on a non- 
subgroup C adenovirus (e.g., the Ad35 cell line of EP 1 054 064 Al) can be utilized to propagate 
a virus of an adenoviral vector of subgroup C (e.g., adenovirus serotype 5) provided that the 
appropriate nucleic acid sequence encoding an E4 protein provided in cis is native to a virus of 
the same or highly similar serotype as that of the El expressed by die complementing cell line 

25 (i.e., an Ad35 E4 protein). 

Complementing cell lines of use in the instant invention are available in the art 
and are not Umited to any specific type. The critical feature, again, is that the heterologous 
segment of E4-encoding nucleic acid sequence provided in cis to the replication-defective vector 
being propagated be native to a virus of the same or highly similar serotype as the El expressed 

30 by the complementing cell line. One aspect of the instant invention employs E 1 -complementing 
cell lines wherein the expressed El is of serotype 5; e.g., PER.C6™ and 293 cell Imes, Both 
these cell lines express the adenoviral El gene product PER.C6™ is described in Fallaux et al, 
1998 Human Gene Therapy 9:1909-1917, hereby incorporated by reference. 293 cell lines are 
described in Graham a/., 1977/. Gen. Virol 36:59-72,hereby incorporated by reference. 
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Anotber aspect of the instant invention are the adenoviral vectors of aay serotype 
falling with adenoviral subgroups A, B, C, D, E and F (for instance, alternative serotypes of 
subgroups A, B (e.g., serotypes 11, 14, 16, 21, 34 and 35), C (e.g., serotype 2), D (eg., serotypes 
24, 26 and 36), E (e.g., serotype 4) and F) which are modified to contain a non-native E4- 
5 encoding nucleic acid sequence in cis which comprises, in whole or m part, nucleic acid 

sequence encoding open reading frame 6 (0RF6). Virus in accordance with this description can 
be propagated in accordance with the above-described methods and rescued usmg any suitable 

means known in the art. 

Another aspect of the mstant invention is a vector in accordance with the instant 

1 0 mvention which comprises a heterologous passenger gene in addition to that of the heterologous 
E4 nucleic acid sequence. In specific embodiments, the passenger gene encodes an antigen. 

As one of ordinary skill m the art will appreciate, die instant methods are not 
limited by the heterologous gene that can be incorporated. The instant invention relates 
generally to a means by w^hich to propagate multiple serotypes of adenovirus in a single 

1 5 complementing cell line and the recombinant virus that make the process possible. In preferred 
embodiments, the passenger gene is incorporated into the El deletion. In alternatively preferred 
embodiments, the passenger gene is inserted in an E3-deleted region. The position of the 
passenger gene, as one of ordinary skill in the art will appreciate, can be varied according to the 
specific complementing cell utilized and the specific deletions present within the rephcation- 

20 defective adenovirus genome. 

In specific embodimwits the passmger gene can encode an HIV-1 antigen, and in 
more preferred embodiments selected firom the group consisting of genes encoding HIV-1 gag, 
pol, nef and env. In alternative embodiments, the passenger gene can be a reporter gene, such as 

secreted alkaline phosphatase (SEAP). 

25 The passenger gene preferably exists in the form of an expression cassette. A 

gene expression cassette preferably comprises (a) a nucleic acid sequence encoding a protein of 
interest; (b) a promoter operatively linked to the nucleic acid sequrace encoding the protein; and 
(c) a transcription termination sequence. The transcriptional promoter of the adenoviral vector is 
preferably recognized by an eukaiyotic RNA polymerase. In a preferred embodiment, the 

30 promoter is a "strong" or "efficient promoter. An example of a strong promoter is the 
unmediate early human cytomegalovirus promoter (Chapman et al, 1991 Nucl Acids Res. 
19:3979-3986), which is hereby incorporated by reference), in certain embodiments without 
intronic sequences. Those skaied m the art, however, will appreciate that any of a number of 
ntiier known nromoters, such as the strong immunoglobulin, or other eukaryotic gene promoters 
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may also be used, including the EFl alpha promoter, the murine CMV promoter, Rous sarcoma 
virus (RSV) promoter, S V40 early/late promoters and flie beta-actin promoter. 

The promoter may comprise a regulatable sequence such as the Tet operator 
sequence. This is extremely useful, for exan:q)le, in cases where the gene products are affecting a 
5 result other than that desired and repression is sought. 

Transcription termination sequences can also be utilized within the gene 
expression cassettes. Preferred termination sequences are, for instance, the bovme growth 
hormone terminator/polyadenylation signal (bGHpA) and the short synthetic polyA signal (SPA) 

of 50 nucleotides in length, defined as follows: 
10 AATAAAAGATCTITATTITCATTAGATCTGTGTGTTGGT-TTTTTGTGTG (SEQ ID 

NO:4). 

Further embodiments incorporate a leader or signal peptide into the transgene. A 
preferred leader is that from die tissue-specific plasminogen activator protein, tPA. 

The following non-limiting Exaniples are presented to better illustrate the 

15 invention. 

EXAMPLE 1 
Constructjon and Rescue 

An El- Ad35-based pre-adenovirus plasmid was constructed in order to 

20 determine whelher an El- Ad35 vector (a representative group B serotype) could be propagated 
in a group C El-complementing cell line. The general strategy used to recover Ad35 as a 
bacterial plasmid is illustrated in Figure 3. Cotransformation of BJ5183 bacteria with purified 
wild-type Ad35 viral DNA and a second DNA fragment termed the Ad35 ITR cassette resulted 
in the circularization of the viral genome by homologous recombination. The ITR cassette 

25 contains sequences from the right (bp 34419 to 34793) and left (bp 4 to 456 and bp 3403 to 
3886) end of the Ad35 genome {see Figures 2A-1 to 2A-10) separated by plasmid sequences 
containing a bacterial origin of replication and an Ampicillin resistance gene. The ITR cassette 
contains a deletion of El sequences from Ad5 457 to 3402 with a unique Swa I site located in the 
deletion. The Ad35 sequences m the ITR cassette provide regions of homology with the purified 

30 Ad35 viral DNA in which recombination can occur. The ITR cassette was also designed to 
contain unique restriction enzyme sites {Pme I) located at the end of the viral ITR's so that 
digestion will release the Ad35 genome from plasmid sequences. Potential clones were screened 
by restriction analysis and one clone was selected as pAd35AEl . Pre-Adenovinis plasmid 
pAd35AEl contains Ad35 sequences from 4 to 456 and bp 3403 to 34793. 
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To determme if pre-adenovirus plasmid pAd35AEl could be rescued into virus 
and propagated in a group C El complementing cell line, the plasmid was digested with /Vne I 
and transfected into a T-25 flask of PER-C6 cells using the calcium phosphate co-precipitation 
technique. Pme I digestion releases the viral genome from the plasmid sequences allowing viral 
replication to occur after entry into 293 cells. Viral cytopaliiic effect (CPE), indicating that virus 
replication and amplification is occurring, was never obsCTved. Cells and media from the 
transfection were harvested at 14 days post transfection, freeze-thawed three times, clarified by 
cMitrifiigation and used to infect new PER.C6 cells but no virus was ever amplified. Following 
multiple attempts, we have be«i unable to rescue and-anq)lify pAd35 AEl in PER.C6 cells. 



EXAMPLE 2 

Insertion of A<^ ^ Orf 6 and AdS E4 into Hie Ad5 Genome 

To refine die strategy of including AdS Orf 6 in the genome of an alternative 
SCTOtype so that propagation could take place in a AdS/group C complemeaiting cell line four 

1 5 additional strategies were devdoped. In the first strategy, the entire alternative serotype E4 
region (not inchiding tiie B4 promoter) was deleted and replaced with Ad5 Orf5. In the second 
strategy, just the alternative serotype Orf6 gene was deleted and replaced with AdS Orf5. In the 
flmd strategy, the entire alternative serotype E4 coding region (not including the E4 promoter) 
was deleted and replaced with the AdS E4 coding region (not including die AdS E4 promoter) 

20 and, in the final strategy, die entire alternative serotype E4 coding and promoter region was 
deleted and replaced wifli the AdS E4 promote and codmg region. The configuration of die E4 
regions genaated by die four strategies is diagramed in Figure 4. For each of diese strategies die 
desired pre-Adenovirus plasmid was generated by bacterial recombination. Cotransfoimation of 
BJ S183 bacteria witti purified wild-type viral DNA and die appropriately constmcted ITR 

2S cassette resulted in the circularization of die viral gaiome by homologous recombination. The 
construction of each pre-Ad plasmid, based on Ad35, is outlined below: 

To construct pAd35AElAE4Ad5C)rf5 (An Ad35 pre-Ad plasmid containing an El 
deletion and an E4 deletion substituted widi AdS OrfiS), an Ad35 ITR cassette was constricted 
containing sequences from die right (bp 3 1599 to 31913 and bp 34419 to 34793) and left (bp 4 to 

30 456 and bp 3403 to 3886) end of die Ad3S genome separated by plasmid sequences containmg a 
bacterial origin of replication and an ampicillin resistance gene. These four segments were 
generated by PGR and cloned sequentially into pNEB193, generating pNBBAd35-4. Next die 
AdS Orf6 open reading fiame was generated by PCR and cloned between Ad3S bp 3 1913 and 
34419 generating pNEBAd3S-4AdSOrf5 (die ITR cassette). PNBB193 is a commonly used 

3S commercially available cloning plasmid (New England Biolabs catif N30S 1 S) containing a 

-13- 



wo 2004/018627 



PCT/US2003/026145 



bacterial origin of replication, ampicillin resistance gene and a multiple cloning site into which 
flie PGR products were introduced. The ITR cassette contains a deletion of El sequences from 
Ad35 bp 457 to 3402 with a unique Swa I restriction site located in the deletion and an E4 
deletion from Ad35 bp 31912 to 34418 into which Ad5 Orf6 was introduced in an E4 parallel 

5 orientation. In this constnict, Ad5(>f6 e^qpression is driven by flie Ad35 E4 promoter The 

Ad35 sequences (bp 31599 to 3 1913 and bp 3403 to 3886) in the ITR cassette provide regions of 
homology witii the purified Ad35 viral DNA in which bacterial recombination can occur 
following cotransfonnation into BJ 5 183 bacteria (Figure 5). The ITR cassette was also 
designed to contain unique restriction enzyme sites (Pmel) located at the end of the vkal ITR's 

10 so that digestion will release the recombinant Ad35 genome from plasmid sequences. Potential 
clones were screened by restriction analysis and one clone was selected as 
pAd35AElAE4Ad50rf6. Pre-Adeaiovirus plasmid pAd35AElAE4Ad50rf6 contains Ad35 
sequences from bp 4 to 456; bp 3403 to bp 31913 and bp 34419 to bp 34793 with Ad50rf6 
cloned between bp 31913 and bp 34419. 

15 To construct pAd35AEl AOrf5Ad50rf5 (An Ad35 pre-Ad plasmid containing an 

El deletion and a deletion of E4 Orf6 substituted wilh Ad5 Orf6), an Ad35 ITR cassette was 
constructed containing sequences from the right (bp 31599 to 32081 and bp 32990 to 34793) and 
left (bp 4 to 456 and bp 3403 to 3886) end of the Ad35 genome separated by plasmid sequences 
containing a bacterial origin of replication and an ampicillin resistance gene. These four 

20 segments were generated by PGR and cloned sequentially into pNEB 193, generating 

pNEBAd35-10. Next the Ad5 Orf5 open reading frame was generated by PGR and cloned 
between Ad35 bp 32081 and 32990 generating pNEBAd3.5-10Ad5Orf6 (tiie ITR cassette). 
PNEB193 is a commonly used comm^ially available cloning plasmid (New England Biolabs 
cat# N3051S) containing a bacterial origin of replication, ampicillin resistance gene and a 

25 multiple cloning site into which flie PGR products were introduced The ITR cassette contams a 
deletion of El sequences from Ad35 bp 457 to 3402 with a unique Swa I restriction site located 
in tiie deletion and a deletion of E4 Orf6 from Ad35 bp 32082 to 32989 mto which Ad5 Orf6 
was introduced in an E4 parallel orientation. In this construct, Ad50rf6 e3q)ression is driven by 
tiie Ad35 E4 promoter. The Ad35 sequences (bp 3 1599 to 32081 and bp 3403 to 3886) in the 

30 YHBi cassette provide regions of homology with the purified Ad35 viral DNA in which bacterial 
recombination can occur following cotransformation into B J 5183 bacteria. The ITR cassette 
was also designed to contain unique restriction enzyme sites {Pme I) located at the end of the 
viral ITR's so that digestion will release tiie recombinant Ad35 genome from plasmid sequences. 
Potential clones were screened by restriction analysis and one clone was selected as 

35 pAd35AEl AOrf6Ad5Qrf6. Pre-Adenovirus plasmid pAd35AEl AOrf6Ad50rf6 contains Ad35 
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sequences from bp 4 to 456; bp 3403 to bp 32081 and bp 32990 to bp 34793 with Ad50rf6 

cloned between bp 3208 1 and bp 32990. 

To construct pAd35 AEl AE4Ad5E4 (An Ad35 pre-Ad plasmid containing an El 
deletion and a deletion of E4 substituted with Ad5 E4), an Ad35 ITR cassette was constructed 
containing sequences from fee ri^t (bp 31599 to 31838 and bp 34419 to 34793) and left (bp 4 to 
456 and bp 3403 to 3886) end of die Ad35 genome separated by plasmid sequences contammg a 
bacterial origin of repUcation and an anq)icUlin resistance gene. These four segments were 
generated by PGR and cloned sequentially into pNEB193, generating pNEBAd35-7. Next the 
Ad5 E4 coding region was generated by PGR and cloned betweai Ad35 bp 31838 and 34419 
generating pNEBAd35-7Ad5E4.2 (Ihe ITO cassette). PNEB193 is a commonly used 
commercially available cloning plasmid (New England Biolabs cat# N305 1 S) containing a 
bacterial origin of repUcation, ampicillin resistance gene and a multiple cloning site into which 
the PGR products were introduced. The ITR cassette contains a deletion of El sequences from 
Ad35 bp 457 to 3402 with a unique Swa I restriction site located in the deletion and an E4 
deletion from Ad35 bp 31839 to 34418 into which the Ad5 E4 coding region was introduced in 
an E4 paraUel orientation. In this construct, the Ad5 E4 region is expressed using the Ad35 E4 
promoter. Hie Ad35 sequences (bp 31599 to 3 1838 and bp 3403 to 3886) in the ITR cassette 
provide regions of homology with the purified Ad35 vual DNA in which bacterial recombination 
can occur following cotransfonnation mto BJ 5 183 bacteria. The ITR cassette was also designed 
to contain unique restriction enzyme sites (Pme 1) located at tiie end of the viral ITR's so fliat 
digestion will release Ihe recombinant Ad35 genome from plasmid sequences. Potential clones 
were screened by restriction analysis and one clone was selected as pAd35AEl AE4Ad5E4. Pre- 
Adenovirus plasmid pAd35AElAE4Ad5E4 contains Ad35 sequences from bp 4 to 456; bp 3403 
to bp 31838 and bp 34419 to bp 34793 with the Ad5 E4 coding region (Ad 5 bp 32914 to bp 

35523) cloned between bp 3 1838 and bp 34419. 

To construct pAd35AEl AE4Ad5PE4 (An Ad35 pre-Ad plasmid containmg an El 
deletion and a deletion of E4 coding region and promoter substituted with Ad5 E4 coding region 
and promoter), an Ad35 ITR cassette was constmcted containing sequences from the right (bp 
31599 to 31838 and bp 34660 to 34793) and left (bp 4 to 456 and bp 3403 to 3886) end of the 
Ad35 genome separated by plasmid sequences containing a bacterial origin of repUcation and an 
ampiciUin resistance gene. Hiese four segments were generated by PGR and cloned sequentiaUy 
into p>fEB193, generating pNEBAd35-8. Next the Ad5 E4 promoter and coding region was 
generated by PGR and cloned between Ad35 bp31838 and34660 generating pNEBAd35- 
8Ad5E4PG (tiie ITR cassette). PNEB193 is a commonly used commercially available cloning 
plasmid (New England Biolabs cafi!^ N3051S) containing a bacterial origin of repUcation, 
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ampicillin resistance gene, and a multiple cloning site into which the PGR products were 
introduced The ITR cassette contains a deletion of El sequences from Ad35 bp 457 to 3402 
with a unique Swa I restriction site located in the deletion and an E4 deletion from Ad35 bp 
3 1839 to 34659 into which the Ad5 E4 promoter and coding region was introduced in an E4 

5 parallel orientation. In this construct, the Ad5 E4 region is expressed usmg the Ad5 E4 

promoter. The Ad35 sequences (bp 3 1599 to 31838 and bp 3403 to 3886) in the ITR cassette 
provide regions of homology with the purified Ad35 viral DNA in which bacterial recombination 
can occur following cotransformation into B J 5 1 83 bacteria. The ITR cassette was also designed 
to contam unique restriction enzyme sites (Pme I) located at the end of the viral ITR's so that 

10 digestion will release the recombinant Ad35 genome from plasmid sequences. Potential clones 
were screened by restriction analysis and one clone was selected as pAd35AEl AE4Ad5PE4. 
Pre-Adenovirus plasmid pAd35AEl AE4Ad5PE4 contains Ad35 sequences from bp 4 tp 456; bp 
3403 to bp 31838 and bp 34660 to bp 34793 with the Ad5 E4 promoter and coding region (Ad 5 
bp 32914 to bp 35826) cloned between bp 31838 and bp 34660. 

15 

EXAMPLE3 

J^e,^nm of pAd35AElAE4AdSOrf6, pAd35AEl A0rf6Ad5Orfl6, DAd35 AElAE4Ad5E4 and 

pAd35AElAE4Ad5PE4 into Vims 

In order to determine if pre-adenovirus plasmids pAd35AEl AE4Ad5Qrf6, 

20 pAd35AEl AOrf6Ad50rf6, pAd35AElAE4Ad5E4 and pAd35ABl AE4Ad5PE4 could be rescued 
into virus and propagated in a group C El complementing cell line, the plasmids were each 
digested with Pme I and transfected into T-25 flasks of PER.C6 cells using the calcium 
phosphate co-precipitation technique; Cell Phect Transfection Kit, Amersham Pharmacia 
Biotech Inc. Pmel digestion releases the viral genome from plasmid sequences allowing viral 

25 replication to occur after cell entry. Viral cytopathic effect (CPE), indicating that virus 

replication and amplification was occurring, was observed for all construct When CPE was 
complete, approximately 7-10 days post transfection, the infected cells and media were 
harvested, freeze/thawed three times and the cell debris pelleted by centrifiigation. 
Approximately 1 ml of the ceU lysate was used to infect aT-225 flasks of PER.C6 cells at 80- 

30 90% confluence. Once CPE was reached, infected cells and media were harvested, 

freeze/thawed three times and the cell debris pelleted by centrifiigation. Clarified cell lysates 
were then used to infect 24ayer NUNC cell fectories of PEILC6 cells. Following complete CPE 
the virus was purified by ultracentrifugation on CsCl density gradients. In order to verify the 
genetic structure of the rescued viruses, viral DNA was extracted using pronase treatment 

35 followed by phenol chloroform extraction and elhanol precipitation. Viral DNA was Ihen 
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digested with Hindm and treated with Klenow fragment to end-label the restriction fragments 
with P33-dATP. The end-labeled restriction fragments were then size-fiiactionated by gel 
electrophoresis and visualized by autoradiography. The digestion products were compared with 
the digestion products of the corresponding pre-Adenovirus plasmid (that had been digested with 
5 PmeVHindin. prior to labeling) from which they were derived. The expected sizes were 
obsK^ed, indicating that the viruses had been successfiilly rescued. 

EXAMPLE 4 

Tn^prtinn nf an Expression 0»sRette into pAd^ '^AEl AE4Ad50rf6. pAd3SAElAOr%Ad50rf6, 
10 pAd35AElAE4Ad5E4 and pA ri^^AEl AE4Ad5PE4 

In ordo- to introduce a gag or SEAP expression cassette into the El region of the 
various Ad35 pre-Adenovirus plasmids described above (pAd35AElAE4Ad50rfi5. 
pAd35AEl AOrf6Ad50rf5, pAd35AEl AE4Ad5E4 and pAd35AElAE4Ad5PE4) bacterial 
recombination was again used. A gag expression cassette consisting of the following: 1) the 
15 immediate early gene promoter from the human cytomegalovirus, 2) the coding sequence of the 
human immunodeficiency virus type 1 (HTV-l) gag (strain CAM-1; 1526 bp) gene, and 3) the 
bovine growth hormone polyadenylation signal sequence (Figure 6), was cloned into the El 
deletion in Ad35 shuttle plasmid, plSIEBAd35-2 (a precursor to the Ad35 ITR cassettes described 
above), generating pNEBAdSSCMVgagBGHpA. pNEBAd35-2 contains Ad35 sequences from 
20 the left end of the genome (bp 4 to 456 and bp 3403 to 3886) with a unique Swal site between bp 
456 and 3403 at the position of the deletioa The gag expression cassette \ras obtained from a 
previously constructed shuttle plasmid by EcoRI digestion. FoUowing the digestion the desired 
fragment was gel purified, treated with Klenow to obtain blunt ends and cloned into the Swal 
site in pNEBAd35-2. This cloning step resulted in the gag expression cassette being cloned into 
25 the El deletion between bp 456 and 3403 in the El paraUel orientation. The shutUe vector 
containing the gag transgene was digested to generate a DNA fragment consisting of the gag 
expression cassette flanked by Ad35 bp 4 to 456 and bp 3403 to 3886 and the fragment was 
purified after electiwphoresis on an agarose gel. Cotransformation of B J 5 1 83 bacteria with tiie 
shuttie vector fragment and one of the Ad35 pre-Ad plasmids (pAd35AElAE4Ad50rf5, 
30 pAd35AElAOrf6Ad50rf6, pAd35AElAE4Ad5E4, pAd35AElAE4Ad5PE4), linearized in tiie El 
region by digestion with Swa I, resulted in the generation of correspondmg Ad35 gag-containing 
pre-Adenovims plasmids (pAd35AElgagAE4Ad50rf6, pAd35AElgagAOrf6Ad50rf6, 
pAd35AElgagAB4Ad5E4, and pAd35AElgagAE4Ad5PE4) by homologous recombination. 
Potential clones were screened by restriction analysis. 
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A similar strategy was used to generate Ad35 pre-Ad plasmids containing a SEAP 
expression cassette. In this case a SEAP expression cassette consisting of: 1) the immediate 
early gene promoter from the human cytomegalovirus, 2) the coding sequence of the human 
placental SEAP gene, and 3) the bovine growth honnone polyadenylation signal sequence 
5 (Figure 7) was cloned into the El deletion in Ad35 shuttle plasmid, pNEBAd35-2, generating 
pNEBAd35CMVSEAPBGHpA, The SEAP cjqMression cassette was obtained from a previously 
constructed shuttle plasmid by EcoRI digestion. Following the digestion the desired fragment 
was gel purified, treated with Klenow to obtain blunt ends and cloned into the Swal site m 
pNEBAd35-2. The transgene was then recombined into flie various Ad35 backbones generating 
10 pAd35AElSEAPAE4Ad50rf6, pAd35AElSEAPAOrf6Ad50rf6, pAd35AElSEAPAE4Ad5E4, 
andpAd35AElSEAPAE4Ad5PE4 as described above for the gag transgene. 
All pre-Ad plasmids were rescued into virus and e)q>anded to prepare CsQ purified stocks as 
described above. 

15 EXAMPLE 5 

In vivo Tran s pene Expression 

A. Immunization 

Female mice were between 4-10 wedcs old The total dose of each vaccine was 
20 suspended in 0. 1 mL of buffer. The vectors were given to both quadriceps of each animals with 
a volume of 50 jiL per quad and using 0.3-mL 28G1/2 insulin syringes (Becton-Dickinson, 
Franklin Lakes, NJ). The rhesus macaques and Afiican green monkeys were between 2-5 kg in 
weight For the primates, the total dose of each vaccine was suspended in I mL of buffe. The 
monkeys were anesthetized (ketamine/xylazine mixture) and the vaccines were delivered i.m. in 
25 0,5-mL aliquots into two muscle sites using tuberculin syringes (Becton-Dickinson, Franklin 
Lakes, NJ). Serum samples were collected at defined intervals and stored firozen until the assay 
date. All anunal care and treatment were in accordance with standards approved by the 
Institutional Animal Care and Use Committee according to the principles set forth in the Guide 
for Care and Use of Laboratory Animals, Institute of Laboratory Animal Resources, National 
30 Research Council. 

R SEAP Assay 

Serum samples were analyzed for circulating SEAP levels using TROPDC 
phospha-light chemiluminescent kit (Applied Biosystems Inc). Duplicate 5 pL aliquots of each 
3 5 serum were mixed with 45 jiL of kit-supplied dilution buffer in a 96-well white DYNEX plate. 
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Serially diluted solutions of a human placental alkaline phosphatase (Catalog no. M5905, Sigma, 
St Louis, MO) in 10% naive monkey or mouse serum served to provide the standard curve. 
Endogenous SEAP activity in the samples was inactivated by heating the well for 30 nunutes at 
65 **C. Enzymatic SEAP activities in the samples were determined following the procedures 
5 described in the kit. Chemiluminescence readings (in relative light units) were recorder using 
DYNEX luminometer. RLU readings are converted to ng/mL SEAP using a log-log regression 
analyses. 

C. Rodent Results 

10 In the first mouse e3q)eriment, cohorts of 5 C3H/HeN mice were given sin^e 

intramuscular injections of one of the following vectors: (1) lO'^lO vp 
Ad35AElSEAPAE4Ad50rf6; (2) 10^10 vp Ad35AElSEAPAE3AE4Ad50rf6; or (3) lO^lO vp 
Ad35AElSEAP. Serum samples prior to and after the mjection were analyzed for circulating 
SEAP activities and the results are shown in Figure 8. Results mdicate that (1) the Ad35 

1 5 constructs are all capable of expressing the SEAP Iransgene and that (2) the introduction of 
Ad50rf6 sequence where the deleted Ad35E4 was did not significantly affect titie transgene 
expression relative to Ad35AElSEAP. Ad35AElSEAPAE3AE4Ad50rf6 also yielded a similar 
expression profile as Ad35AElSEAP. Ihe levels of SEAP in the serum dropped after day 2 and 

were at background levels by day 1 2. 
20 The second mouse experiment evaluates the effect ofafiillAd5E4 replacement 

instead of an Ad50rf6 substitution for the Ad35 E4 cassette. Here, cohorts of 5 C3H/HeN mice 
were given single intramuscular injections of one of the foUowmg vectors; (1) 10^10 vp 
MRKAd5-SEAP; (2) 10^9 vp MRKAd5-SEAP; (3) 10^10 vp Ad35AElSEAPAE4Ad50rf5; (4) 
10^10 vp Ad35AElSEAPAE4Ad5E4; or (5) 10^10 vp Ad35AElSEAPAE4Ad5PE4. The 
25 introduction of Ad5E4 or Ad5PE4 resulted in comparable if not, sHghtly improved expression 
levels compared to the vector witii the Ad50rf6 insertion (Figure 9). The peak levels for the 
Ad35 constructs are lower than those produced by AdSSEAP (at least 10-fold). 

D. Primate Results 

30 Cohorts of 3 rhesus macaques were given single intramuscular injections of one 

of the following vectors: (1) lO'^l 1 vp MRKAd5-SEAP; (2) 10^9 vp MRKAd5-SEAP; or (3) 
10^1 1 vp Ad35AElSEAPAE4Ad50rf6. Serum samples prior to and after the injection were 
analyzed for circulating SEAP activities and the results for the individual monkeys are shown in 
Figures 1 OA-B. Results indicate that the peak level of SEAP product produced by the alternative 

35 adenovirus serotype was lower than but were within 3-fold of that of MRKAdSSEAP at the same 
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high dose level of 10^1 1 vp. The levels observed firom the Ad35 vector were about 50-fold 
higher than those observed using 10^9 vp of MRKAdSSEAP. The levels of SEAP in flie serum 
dropped after day 10 and were close to backgroimd as early as d^ 15. 

A separate experiment using Afiican green monkeys was conducted to examine 

5 the effect of the additional E3 deletion or the full Ad5E4 substitution on in vivo gene expressioa 
In here, cohorts of 2-3 Afirican green macaques were given single intramuscular injections of one 
of the foUowing vectors: (1) 10^11 vp MRKAd5-SEAP; (2) lO^lO vp MRKAd5-SEAP; (3) 10^9 
vp MRKAdS-SEAPj (4) 10^10 vp Ad35AElSEAPAE4Ad50rjK; (5) 10^10 vp 
Ad35AElSEAPAE3AE4Ad50rf6; or (6) 10^10 vp Ad35AElSEAPAE4Ad5E4. Results (Figure 

10 11) indicate that the peak levels of SEAP product produced by Ad35AEl SEAPAE3AE4Ad50rf6 
and Ad35AElSEAPAE4Ad5E4 were comparable if not, slightly improved compared to 
Ad35 AEl SEAPAE4Ad50rf6. 

EXAMPLE 6 
15 In vivo ImniuDOgenicitv 

A TmrmmirAtiOTi 

Cohorts of 3-6 animals were given intramuscular injections at wk 0 and wk 4 of 
either of the following constructs: (1) 10^1 1 vp MRKAdS-fflVl gag; or (2) 10^11 vp of 

20 Ad35AElgagAE4Ad50rf6. Rhesus macaques were between 3-10 kg in weight. In aU cases, the 
total dose of each vaccine was suspended in 1 mL of buffer. The macaques were anesthetized 
(ketamine/xyla2dne) and the vaccines were deUvered i.m. in 0.5-niL aliquots into both deltoid 
muscles using tuberculin syringes (Becton-Dickinson). Sera and peripheral blood mononuclear 
cells (PBMC) were prepared from blood samples coUected at several time points during the 

25 immunization regimen. All animal care and treatment were in accordance with standards 
approved by the Institutional Animal Care and Use Committee according to the principles set 
forth m the Guide for Care and Use of Laboratory Animals, Institute of Laboratory Animal 
Resources, National Research Council. 

30 ELISPOT Assay 

The BFN-y ELISPOT assays for rhesus macaques were conducted following a 
previously described protocol (Allen et al, 2001 Firo/. 75(2):738-749), with some 
modifications. For antigen-specific stimulation, a peptide pool was prepared from 20-aa 
peptides that encompass tiie entire HIV-1 gag sequence with 10-aa overlaps (Synpep Corp., 

35 Dublin, CA). To each well, 50 [iL of 2-4 x lO^ peripheral blood mononuclear cells (PBMCs) 
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were added; the ceUs were counted iising Beckman Coulter Z2 particle analyzer with a lower 
size cut-offset at 80 femtoliters ("fL"). Either 50 pL of media or the gag peptide pool at 8 
Hg/mL concentration per peptide was added to the PBMC. The samples were incubated at STC, 
5% CO2 for 20-24 hrs. Spots were developed accordingly and the plates were processed using 
5 custom-built imager and automatic counting subroutine based on the ImagePro platform (Silver 
Spring, MP); the counts were normalized to 10^ cell input. 

C. Intracellular Cy tnlHrift StainiTip 

To 1 ml of 2 X 10^ PBMC/mL m complete BO^MI media (in ITxlOOmm round 

10 bottom polypropylene tubes (Sarstedt, Newton, NC)), anti-hCD28 (clone L293, Becton- 

Dickinson) and anti-hCD49d (clone L25, Becton-Dickinson) monoclonal antibodies were added 
to a final concentration of 1 jig/mL. For gag-specific stimulation, 10 mL of the peptide pool (at 
0.4 mg/mL per peptide) were added. The tubes were incubated at 37 °C for 1 hr., after which 20 
fiL of 5 mg/mL of brefeldin A (Sigma) were added. The cells were incubated for 16 hr at 37 "^C, 

15 5% CO2, 90% humidity, 4 mL cold PBS/2%FBS were added to each tube and the cells were 
pelleted for 10 min at 1200 ipm- The cells were re-suspended in PBS/2%FBS and stained (30 
min, 4 °C) for surface markers using several fluorescent-tagged mAbs: 20 ^L per tube anti- 
hCD3-APC, clone FN- 18 (Biosource); 20 \xL anti-hCDS-PerCP, clone SKI (Becton Dickinson, 
Franklin Lakes, NJ); and 20 yL anti-hCD4-PE, clone SK3 (Becton Dickinson). Sample handhng 

20 firom this stage was conducted in the dark. The cells were washed and incubated in 750 ^xL 
IxFACS Perm buffer (Becton Dickinson) for 10 min at room temperature. The cells were 
peUeted and re-suspended in PBS/2%FBS and 0.1 \ig of FITC-anti-hlFN-y, clone MD-1 
(Biosource) was added. After 30 min incubation, the cells were washed and re-suspended in 
PBS. Samples were analyzed using all four color channels of the Becton Dickmson 

25 FACSCalibur instrument. To analyze the data, the low side- and forward-scatter lymphocyte 
population was initially gated; a common fluorescence cut-off for cytokine-positive events was 
used for both CD4'^ and CD8^ populations, and for both mock and gag-peptide reaction tubes of 
a sample. 

30 D. Results 

PBMCs collected at regular 4-wk mtervals were analyzed in an ELISPOT assay. 
Results (Table 1) mdicate that the Ad35AElgagAE4Ad50rf6 is able to induce in non-human 
primates significant levels of gag-specific T cells. After a single dose (wk 4), the Ad35-induced 
responses were about 5-fold lower than that of MRKAd5-HIVl gag. After the second dose (wk 
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8), the responses between both cohorts were comparable; the differences became pronounced in 
the succeeding time points. 

Table 1. Gag-specific T cell response in monkeys immunized with MRKAdS-HIVlgag and 
5 Ad3 5 AEl gagAE4Ad50r£5. Shown is the number of spot-forming cells per million PBMG 
following incubation in the absence (mock) or presence of Gag H peptide pool. The H pool 
consisted of 20-aa peptide overlapping by 10 aa and encompassing the entire gag sequence. 



Grp 


Vaccina 
WkO,Wk4 


Monkey 
ID 


Pre 


W1c4 


Wk8 


Wk12 


Wk 


16 


Mock 


GagH 


fi/locfc 


Gag H 


Mock 


GagH 


Mock 


GagH 


Mock 


GagH 


1 


MRKAcf5-HIV1 gag 
10*11 VP 


00CO16 
00CO34 
0OCO58 


1 
0 
4 


5 
4 
4 


13 

5 

3 


1025 

219 

1086 


0 
5 
0 


824 
404 
440 


3 
0 
0 


753 
491 
439 


1 
1 
0 


533 
350 
599 


2 
3 


Ad35AE1gagAE4Ad50rf6 
lOMIvp 

Naivo 


000045 
00D067 
O0DO68 
OOD054 
00D075 
00D073 

O0DOB7 


1 
1 
1 
3 
3 
14 

1 


1 

4 

4 

15 

5 

26 

1 


3 

5 

10 

10 

18 

1 

3 


168 

89 

34 

195 

275 

241 

3 


5 
0 
5 
0 
13 
3 

8 


645 
103 
365 
501 
716 
485 

54 


4 
0 
3 
3 
3 
3 

3 


178 

76 

143 

350 

158 

278 

5 


0 
0 
0 
0 
0 
0 

3 


91 

19 

S5 

124 

103 

148 

1 



10 

Intracellular IFN-y staining analyses of PBMC collected at wk 8 suggest that the Ad35-based 
vaccine is able to induce both HIV-specific CD4+ and CD8+ T cells (Table 2). 

Table 2. Characterization of the gag-specific T cells in monkeys immunized with MRKAdS- 
15 HIVlgag and Ad35AElgagAE4Ad5C>rf6. Shown are the percentages of CD3+ T cells that are 
either gag-specific CD4+ or gag-specific CD8+ cells. These values were corrected for mock 
values (<0.02%). 



Grp 


Vaccine 


Monkey 


Wk8 


WkO, Wk4 


ID 


%CD4+CD3+ 


%CD8-K:D3+ 


1 


MRKAd5-HlV1 gag 


O0C018 


0.08 


0.37 


lO'^H vp 


O0C034 


0.09 


0.06 




O0G058 


0.03 


0.21 


2 


Ad35AE1 gagAE4Ad50rf6 


O0D045 


0.06 


0.08 




10^11 vp 


OOD067 


0.02 


0.02 




00D06B 


0.15 


0.02 






OOD054 


0.05 


0.08 






O0D075 


0.08 


0.05 






O0D073 


0.09 


0.06 



In a separate experiment, 3 different Ad35 constructs expressing HIV-1 gag were evaluated for 

their immunogenicity in macaques. Here, cohorts of 3 macaques were given immunizations at 

wk 0 and4 of either of the following vectors: (1) 10^10 vp Ad35AElgagAE4Ad50rf5; (2) 10^10 
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vp A(i35AElgagAE3AE4Ad50if6; or (3) 10^10 vp Ad35AElgagAE4Ad5E4. The levels of T 
cell immunity induced by all 3 vectors were comparable at this stage (Table 2), suggesting that 
llie additional E3 deletion or full Ad5E4 substitution does not appear to impair the immunogenic 
properties of the vector. 

Table 3. Gag-specific T cell re^onse in monkeys immunized with several Ad35AEl AE4-based 
vectors. Shown is the number of spot-forming cells per miUion PBMC foUowing incubation in 
the absence (mocKO or presence of Gag H peptide pool. Hie H pool consisted of 20-aa peptide 
ovCTlapping by 10 aa and enconq)assing the entire gag sequence. 
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Grp 


Vaccine 
WkO.Wk4 


Monkey 
ID 


Pre 


Wk4 


Wk8 


Mock 


GagH 


Mock 


GagH 


Mock 


GagH 


1 


Ad35AE1 gagAE4Ad50rf6 
lO^IOvp 


00C047 
00C157 
00C078 


4 
8 

3 


1 
5 
1 


0 
1 
0 


20 
81 
46 


0 
1 
4 


189 
833 
349 


2 


Ad35AE1 gagAE3AE4Ad50rf6 
lO'^IOvp 


00C091 
00C122 
00D177 


1 
3 
3 


1 
0 
3 


1 
0 
1 


118 

31 

45 


3 
1 
1 


315 
138 
64 


3 


Ad35AE1 gagAE4Acl5h4 
10*10vp 


OOD018 
00D046 
O0D063 


3 
8 
3 


19 
5 
4 


29 
1 
0 


120 

21 

63 


23 
10 
4 


193 
143 
371 


Naive 


none 


00D363 


0 


5 


ND 


ND 


0 


0 



EXAMPLE 7 

nnnstmction and R ftgciie of pAd24AEl. 

15 An El- Ad24-based pre-adenovirus plasmid was constructed in order to 

detemiine whether an El- Ad24 vector (a representative group D serotype) could be propagated 
in an Ad5/group C El -complementing cell line. Since at the time the vector construction was 
initiated die complete sequence of Ad24 (see Figures 16A-1 through 16A-10; subject of 
copending appUcation serial no. 60/455, 312, filed March 17. 2003) was unknown we took 

20 advantage of some sequence homology between Ad24 and Adl7. The general strategy used to 
recover Ad24 as a bacterial plasmid is illustrated in Figure 12 and described below. 
Cotransforaiation of B J5 1 83 bacteria with purified wild-type Ad24 viral DNA and a second 
DNA fiagment termed the Adl7 ITR cassette resulted in the circularization of the viral genome 
by homologous recombination. The ITR cassette contains sequences fix>m the right (bp 34469 to 

25 35098) and left (bp 4 to 414 and bp 3373 to 4580) end of die Adl7 genome (Accession No. 
AF 108 105) separated by plasmid sequaices containing a bacterial origin of repUcation and an 
An5>icillin resistance gene. The ITR cassette contains a deletion of El sequences fi-om Adl7 
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(bp 415 to 3372) with a unique Swa I site located in the deletioa The Adl7 sequences in the 
ITR cassette provide regions of homology with the purified Ad24 viral DNA in which 
recombination can occur. The ITR cassette was also designed to contain unique restriction 
enzyme sites (Pme I) located at the end of the viral ITR's so that digestion will release the Ad24 
5 genome from plasmid sequences. Potential clones were screened by restriction analysis and one 
clone was selected as pAd24AEl. pAd24AEl contains Adl7 sequences from bp 4 to 414 and 
from bp 3373 to 4580, Ad24 bp 4588 to 34529, and Adl7 bp 34469 to 35098 (bp numbers refer 
to the wt sequence for both Adl7 and Ad24). PAd24AEl contains the coding sequences for all 
Ad24 virion structural proteins that constitute its serotype specificity. This approach can be used 

10 to circularize any group D serotype into plasmid form which has sufficient homology to Adl7. 

To determine if pre-adenovirus plasmid pAd24AE 1 could be rescued irito virus 
and propagated in a group C El complementing cell line, the plasmid was digested with Pme I 
and transfected into a 6 cm dish of 293 cells using the calcium phosphate co-precipitation 
technique. Pme I digestion releases the viral genome from the plasmid sequences allowing viral 

15 replication to occur aft^ entry into 293 cells. Viral cytopathic effect (CPE), indicating that virus 
replication and amplification is occurring, was very slow to arise. Following multiple attempts, 
we were successful at rescuing and amplifying Ad24AE 1 but the virus grew to lower titers and 
took more passages to amplify than a similar Ad5 based vector. In order to vrafy the genetic 
structure of the virus, viral DNA was extracted using pronase treatment followed by phenol 

20 chloroform extraction and ethanol precipitation. Viral DNA was then digested with Hin6ni and 
treated with Klenow fragment to end-label the restriction firagments with P33-dATP. The end- 
labeled restriction fragments were then size-fractionated by gel electrophoresis and visualized by 
autoradiography. The digestion products were cofnpared with the digestion products from the 
pre-plasmid (that had been digested with Pmel/Hindm prior to labeling). The expected sizes 

25 were observed, indicating that the virus had been successfully rescued. 

EXAMPLE 8 

Insertion of Ad5 Qrf 6 mto the El region of Ad24 

In order to determine if the insertion of Ad5 E4 Orfl5 into the Ad24 genome 

30 would allow more efficient propagation in a group C El complementing cell line we constructed 
an Ad24 based pre-adenovirus plasmid containing Ad5 OrfB in the El region. In order to 
introduce Ad5 Or£6 in to the El region of pAd24AEl, bacterial recombination was used. An 
Ad5 Oif6 transgene consisting of the Ad5 Orf6 coding region flanked by the HCMV promoter 
and pA was cloned into the El deletion in an Adl7 shuttle vector (a precursor to the Adl7 ITR 

35 cassette). The Ad5 Orf6 transgene was cloned between bp 414 and 3373 in ttie El anti-parallel 
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orientation. ITie shuttle vector containing the Ad5 Orf5 transgene was digested to generate a 
DNA fragment consisting of the transgene flanked by Adl7 sequences (bp 4 to 414 and bp 3373 
to 4580) and the fragment was purified after electrophoresis on an agarose gel. 
Cotransfbnnation of BJ 5183 bacteria with liie shuttle vector fragment and pAd24AEl, which 
5 had been linearized in the El region by digestion with SwdU resulted in the generation of 

pAd24AElAd50r^ by homologous recombination (Figure 13). Potential clones were screened 
by restriction analysis and one clone was selected as pre-ademovirus plasmid 

pAd24AElAd50rfS. 

In order to determine if pre-adenovnns plasmid pAd24AElAd50rfi5 could be 

10 rescued into virus and propagated in an Ad5/groap C El complementing cell line, 

pAd24AElAd50rfi5 was digested with Pme I and transfected mto a 6 cm dish of 293 cells using 
the calcium phosphate co-precipitation technique. Pmel digestion releases the viral genome 
from plasmid sequences aUowing viral replication to occur after entry into 293 cells. Once 
complete viral cytopatiiic effect (CPE) was observed at approximately 7-10 days post 

15 transfection, the infected cells and media were freeze/thawed three times and tiie cell debris 

pelleted. The virus was amplified in two additional passages in 293 cells and then purified from 
the final infection by ultracentri&gation on CsCl density gradients, hi order to verify the genetic 
structure of the virus, viral DNA was extiacted using pronase treatment foUowed by phenol 
chloroforin extraction and ethanol precipitation. Viral DNA was then digested witii Hindm and 
20 treated with Klenovsr fragment to end-label tiie restriction fragments with P33-dATP. The end- 
labeled restriction fragments were then size-fractionated by gel electrophoresis and visualized by 
autoradiography. The digestion products were compared with tiie digestion products from the 
pre-plasmid (that had been digested witii PmellHindSL prior to labeling). The expected sizes 
WCTC observed, indicating that tiie virus had been successfully rescued. 

25 

EXAMPLE 9 

Tncftrrinn of Ad5 Orf 6 into the E4 r epion of Ad24 

To refine Ae strategy of including Ad5 Orf6 in tiie gaiome of an alternative 
serotype so tiiat propagation could take place m an Ad5/group C complementing cell line two 

30 additional strategies were developed. In the first strategy, tiie entire alternative serotype E4 
region (not including flie E4 promoter) was deleted and replaced witii Ad5 Orf6. In tiie second 
strategy, just tite alternative serotype Orf6 gene was deleted and replaced witii Ad5 Orf6. The 
configuration of tiie E4 regions generated by flie two strategies is diagramed in Figure 14. For 
each of tiiese strategies tiie desired pre- Adenovirus plasmid was generated by bacterial 

35 recombination. Cotransformation of BJ 5 183 bacteria witii pAd24AOrf6BstZ17I and flie 
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appropriately constructed A(124 E4 shuttle plasmid resulted in the generation of the desked Ad24 
based pre-Ad plasmid. PAd24AOrf5BstZ17I, a derivative of pAd24AEl, was constructed so that 
the E4 region in the Ad24 pre-Ad plasmid could be easily modified using bacterial 
recombination. PAd24AQrf6BstZ17I contains a deletion in the E4 region from Ad24 bp 32373 to 

5 bp 33328 with a unique BstZYJl site located at the position of the deletion. The complete 
sequence of pAd24AOrf5BstZ17I consists of Adl7 sequences from bp 4 to 414 and from bp 
3373 to 4580, Ad24 bp 4588 to 32372 and from 33329 to 34529, and Adl7 bp 34469 to 35098 
(bp numbers refer to the wt sequence for both Adl7 and Ad24). 

To construct pAd24AEl AE4Ad5C)rf6 (An Ad24 pre-Ad plasmid containing an El 

10 deletion and a deletion of E4 substituted with Ad5 Orf6), an Ad24 E4 shutfle plasmid was 

constructed by digesting pAd24AEl with Pmel and BsrGl and cloning the restriction fragment 
representing the E4 region (bp 31559 to bp 35164) into pNEB193, generating pNEBAd24E4. 
PNEBAd24E4 was then digested with Accl and EcoW. to remove the E4 coding sequences and 
ligated with an oligo designed to contain BgUi and Xhol sites (underlined) (5' 

15 ACTCGAGATGTATAGATCT (SEQ ID NO: 6); 5' CTAQATCTATACATCICGAG (SEQ ID 
NO: 7)), generating pNEBAd24AE4. PNEBAd24AE4 was then digested with BgUl and Xhol 
and ligated with the Ad5 Orf5 gene, which was PGR amplified, generating 
pNEBAd24AE4Ad50rf5. The PGR primers used to amplify the Ad5 Orf6 gene (5' 
GCACAGMCTTTGCTTCAGGAATATG (SEQ ED NO: 8); 5' 

20 GAGAACTCGAGGCCTACATGGGGGTAGAG (SEQ ID NO: 9)) were designed to contain 
5g/n and A3k?I sites (underlined above) for Ugation with the pNEBAd24D^ fragment In the 
final step pNEBAd24AE4Ad50rf5 E4 shuttle plasmid was digested with Pvul and Pmel, the 
restriction fiugments were size fractionated by agarose gel electrophoresis and the desired 
fragment containing Ad50rf6 flanked by Ad24 sequences was gel purified. Cotransformation of 

25 B J 5 1 83 bacteria with E4 shuttle fragment aud pAd24AOrf6BstZ 1 71, which had been linearized 
in the E4 region by digestion with BstZVTl, resulted in the generation of pAd24AEl AE4Ad50rfi6 
by homologous recombination. Potential clones were screened by restriction analysis and one 
clone was selected as pre-adenovirus plasmid pAd24 AE 1 AE4 Ad50rf6. 

To construct pAd24AEl AOrf5Ad50rf5 (An Ad24 pre-Ad plasmid containing an 

30 El deletion and a deletion of E4 Orf6 substituted with Ad5 Orf6), an Ad24 E4 shuttle plasmid 
was constructed in which the Ad24 Orf6 gene was replaced by Ad5 Orf6. To do this the EcdRX 
restriction fragment representing bp 32126 to bp 33442 of the Ad24 genome (encompassing the 
E4 Orf6 coding region), was subcloned into the £coRI site in pNEB193, generating 
pNEBAd240rf5. In order to delete the E4 Orf5 gene in pNEBAd240rf6 and replace it with Ad5 

35 Orf6, pNEBAd240rf6 was digested with Styl and treated with Klenow to blunt the ends and thm 
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digested wifli to EagL The desired pNEBAd240rfl5 fiagment was then ligated with a PGR 
product representing the Ad5 Orf5 gene from Ad5 bp 33 193 to bp 24125, generating 
pNEBAd24AOrf6Ad50rf6. The PGR primers used to generate IheAdS Or£6 fragment 
(S'CGAGA^GCCGAGGCAGATCTGTTTG (SEQ ID NO: 10); 

5 S'GAAG TCCCGGGC TAGATGGGGGTAG (SEQ ID NO: 1 1)) were designed to contain Eagl 
and Smal sites (underlined above) for Ugation with flie pNEBAd240ra fragment In the final 
step pNEBAd24AOrf6Ad50rfi5 was digested with EcdBl, the restriction fragments were size 
fractionated by agarose gel electrophoresis and the desired fragment containing AdSOrfS flanked 
by Ad24 sequences was gel purified. Gotransfonnation of BJ 5183 bacteria witii the EcoBl 

10 fragment and pAd24AOrf6BstZ17I, which had been linearized in Uie E4 region by digestion with 
BsOm, resulted in the generation of pAd24AElAOrf6Ad50rf6 by homologous recombination. 
Potential clones were screened by restriction analysis and one clone was selected as pre- 
adenovinis plasmid pAd24AEl AOrfiSAd50rf6. 

15 EXAMPLE 10 

T^escue of DAd24AElAE4Ad'SOrf6. DAd24AEl AOrf6Ad5Q rfl5. into Virus 

In order to determine if pre-adaiovirus plasmids pAd24AElAE4Ad50rf6, 
pAd24AElAOrf6Ad50rf6, could be rescued into virus and propagated ina groiq) G El 
complementing cell line, the plasmids were each digested with Pine I and tiansfected into T-25 

20 flasks of PERG6 cells using the calcium phosphate co-precipitation technique; (GeU Phect 

Transfection Kit, Amersham Pharmacia Biotech Inc.). Pmel digestion releases the viral genome 
from plasmid sequences allowmg viral replication to occur afro: cell entry. Viral cytopathic 
effect (CPE), indicating that virus rephcation and amplification was occurring, was observed for 
botii constmcts. When GPB was complete, ^proximately 7-10 days post transfection, the 

25 infected cells and media were harvested, freeze/thawed three times and the cell debris peUeted 
by centriftigation. Approximately 1 ml of the cell lysate was used to infect T-225 flasks of 
PERC6 cells at 80-90% confluence. Once CPE was reached, infected cells and media were 
harvested, fieeze/thawed tbi«e times and die cell debris pelleted by cenfrifiigation. Clarified cell 
lysates were then used to infect 2-Iayer NUNC cell fectories of PER-C6 cells. FoUowing 

30 conrolete CPE the virus was purified by ultracentrifiigation on CsCl density gradients. In order 
to verify the genetic structure of the rescued viruses, viral DNA was extracted using pronase 
treatment followed by phenol chloroform extraction and ethanol precipitation. Viral DNA was 
thai digested with Hin6SL and treated with Klenow fragment to end-label the restriction 
fragments with P33-dATP. The end-labeled restriction fragments were then size-fractionated by 

35 gel electrophoresis and visualized by autoradiography. The digestion products were con5)ared 
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with the digestion products of the conesponding pre-Adenovinis plasmid (that had been digested 
with PmeVHindm prior to labeling) from which fhey were derived. The expected sizes were 
observed, indicating that the viruses had been successfully rescued. 

5 EXAMPLE 11 

Comparison of the Growth Kinetics of Ad24 base d vectors. 

In order to compare the growth kinetic of Ad24AEl , Ad24AEl Ad50rf6, 
Ad24AEl AE4Ad50rf6 and Ad24AEl AOrfi6Ad50rf6 one step growth curves were preformed 
(Figure 15). PHLC6 cells in 60 mm dishes were infected at 1 vp per cell with eidio- Ad24AEl, 

10 Ad24AElAd50rfi5, Ad24AEl AE4Ad50rf5 or Ad24AElAC)rB5Ad50rf6. Cells and media were 
then harvested at various times post infection, freeze Aawed three times and clarified by 
centrifiigation. The amount of virus present in the saniples was detramined by quantitative PGR 
and is illustrated in Figure 15. This study demonstrates that Ad24 vectors that incorporate Ad5 
Orf5 have a distinct growth advantage over Ad24AEl m PER.C6 cells. The instant invention 

15 can be practiced with recombinant Ad24 vectors absent a heterologous Orf 6 region where the 
El-complementing cell line expresses an Ad24 El region or, alternatively. El and E4 regions of 
the same serotype (such as Ad5E l/E4-expressing cell lines). 

EXAMPLE 12 

20 Insertion of an F-xnression Cassette into pAd24AE1 AE4Ad50rf6. pAd24AElAOrfl5Ad50 rf6, 

Jn Older to introduce a gag or SEAP raqpression cassette (see Figures 6 and 7, 
respectively) into die El region of the Ad24 pre-Adenovirus plasmids described above 
(pAd24AElAE4Ad50rf6, pAd24AElAOrf6Ad50rf5) bacterial recombination was used. A gag 
ejqoession cassette ctmsisting of tiie following: 1) the immediate early gene promoter &om the 

25 human cytomegalovirus, 2) the coding sequence of the human immunodeficienqr virus type 1 
(HTV-l) gag (strain CAM-1; 1526 bp) gene, and 3) the bovine growth hormone polyadenylation 
signal sequence, was cloned into the El deletion in Adl7 shutde plasmid, pABSAdl7-3, 
generating pABSAdlTHCMVgagBGHpA The ITR cassette contains sequences from the right 
(bp 34469 to 35098) and left (bp 4 to 414 and bp 3373 to 4580) end of tihe Adl7 genome 

30 separated by plasmid sequences containing a bacterial origin of replication and an Ampicillin 
resistance gene. The ITR cassette contains a deletion of El sequences from Adl7 (bp 415 to 
3372) with a unique Swa I site located in the deletion. The gag expression cassette was obtained 
from a previously constructed shutfle plasmid by EcoRI digestion. Following the digestion the 
desired fragment was gel purified, treated with Klenow to obtain blunt ends and cloned into the 

35 Swal site in pABSAdl7-3. This cloning step resulted in the gag e:q>ression cassette being 
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cloned into the El deletion between bp 414 and 3373 in the El parallel orientation. The shutUe 
vector containing the gag transgene was digested to generate a DNA fragment consisting of the 
gag expression cassette flanked by Adl7 bp 4 to 414 and bp 3373 to 4580 and the fiagment was 
purified after electrqphoresis on an agarose geL Cotransformation of BJ 5 183 bactMia with the 

5 shuttle vector fragment and one of the Ad24 pre-Ad plasmids (pAd24AEl AE4Ad50rf6, 

pAd24AEl AOrf6Ad50rf6,), linearized in the El region by digestion wifli Swa I, resulted in the 
gen«ation of the corresponding Ad24 gag-containing pre-Adoiovirus plasmids 
(pAd24AElgagAE4Ad50rf6, pAd24AElgagAOrf6Ad50rf6) by homologous recombination. 
Potential clones were screened by restriction analysis. 

IQ A similar strategy was used to genacate Ad24 pre-Ad plasmids containing a SEAP 

expression cassette. In this case a SEAP expression cassette consisting of: 1) the immediate 
early gene promoter from the human cytomegalovirus, 2) Ae coding sequraice of the human 
placental SEAP gene, and 3) the bovine growth hormone polyadenylation signal sequence was 
cloned into the El deletion in Adl7 shuttle plasmid, pABSAdl7-3, generating 

1 5 pABSAdl 7HCMVSEAPBGH. The SEAP expression cassette was obtained from a previously 
constructed shuttle plasmid by EcoRI digestion. Following the digestion the desired fiagment 
was gel purified, treated with Klenow to obtain blunt ends and cloned into the Swal site in 
pABSAdl7-3. The shuttle vector containing the SEAP transgene was digested to generate a 
DNA fragmait consisting of the SEAP ejqjression cassette flanked by Adl7 bp 4 to 414 and bp 

20 3373 to 4580 and tiie fragmrat was purified after electrophoresis on an agarose gel. 

Cotransformation of BJ 5 183 bacteria with the shuttle vector fiagment and one of tiie Ad24 pre- 
Ad plasmids (pAd24AElAE4Ad50rf6, pAd24AElAOrf6Ad50rf5,), Unearized in the El region 
by digestion with Swa I, resulted in Hxe generation of the corresponding Ad24 SEAP-containing 
pre-Adenovirus plasmids (pAd24AElSEAPAE4Ad50rf6, pAd24AElSEAPAOrf6Ad50rfi5) by 

25 homologous recombination. Potential clones were screened by restriction analysis. All pre-Ad 
plasmids were rescued into virus and expanded to prepare CsCl purified stocks as described 
above. 

EXAMPLE 13 
30 /v« Vfvn Tmnmnogenicitv 

/\ iTninnniTatinn 

Cohorts of 3-6 animak were given intramuscular injections at wk 0 and wk 4 of 
ei4er of the foUowing constructs: (1) 10^1 1 vp MRKAd5-HIVl gag; (2) 10^10 vp MRKAd5. 
35 HIVl gag; (3) 10^1 1 vp of Ad24AElgagAOrf6Ad50rf6; (4) 10^10 vp of 
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A(i24AElgagAOrf6A<i50rf6; or (5) 10^10 vp of Ad24AElgagAE4Ad5Qrf6. Rhesus macaques 
were between 3-10 kg in weight In all cases, the total dose of each vaccine was suspended in 1 
mL of buffer. The macaques were anesthetized (ketamine/xylazine) and the vaccines wore 
delivered i.m. in 0.5-mL aliquots into both deUoid muscles using tuberculin syringes (Becton- 
5 Dickinson, Frankhn Lakes, NJ). Peripheral blood mononuclear cells (PBMC) were prepared 
from blood samples collected at several time points (typically 4 wk intervals) during the 
umnunization regimen. All animal care and treatment were in accordance with standards 
approved by the Institutional Animal Care and Use Committee according to the principles set 
forth in the Guide for Care and Use of Laboratory Animals^ Institute of Laboratory Ammal 
1 0 Resources, National Research Council. 

B. ELISPOT Assay 

The EFN-y ELISPOT assays for rhesus macaques were condticted following a 
previously described protocol (Allen et al., 2001 J. Tirol 75(2);738-749; Casimiro et al., 2002 /. 

1 5 Virol 76: 1 85-94), with some modifications. For antigen-specific stimulation, a peptide pool was 
prepared from 20-aa peptides that encompass the entire HIV-1 gag sequence with 10-aa overlaps 
(Synpep Corp., Dublin, CA). To each well, 50 |xL of 2-4 x 10^ peripheral blood mononuclear 
cells (PBMCs) were added; the cells were counted usmg Beckman Coulter Z2 particle analyzer 
witii a lower size cut-ofFset at 80 femtoUters ("fL"). Either 50 pL of media or the gag peptide 

20 pool at 8 jig/mL concentration per peptide was added to flie PBMC. The samples were 

incubated at 37**C, 5% CO2 for 20-24 hrs. Spots were developed accordingly and the plates were 
processed using custom-built imager and automatic counting subroutine based on the ImagePro 
platform (Silver Spring, MD); the counts were normalized to 10^ cell input 

25 C. Intracellular Cvtokine Staining 

To 1 ml of 2 X 10^ PBMC/mL in complete RPMI media (m 17x1 00mm round 
bottom polypropylene tubes (Sarstedt, Newton, NC)), anti-hCD28 (clone L293, Becton- 
Dickmson) and anti-hCD49d (clone L25, Becton-Dickinson) monoclonal antibodies were added 
to a final concentration of 1 jig/mL. For gag-specific stimulation, 10 jiL of the peptide pool (at 

30 0.4 mg/mL per peptide) were added. The tubes were incubated at 37 ^C for 1 hr., after which 20 
pL of 5 mg/mL of brefeldin A (Sigma) were added. The cells were incubated for 16 hr at 37 °C, 
5% CO2, 90% humidity. 4 mL cold PBS/2%FBS were added to each tube and the cells were 
pelleted for 10 min at 1200 ipm. The cells were re-suspended in PBS/2%FBS and stained (30 
min, 4 ""C) for surface markers using several fluorescent-tagged mAbs: 20 [iL per tube anti- 

35 hCD3-APC, clone FN-18 (Biosource); 20 ^iL anti-hCD8-PerCP, clone SKl (Becton Dickinson); 
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aad 20 anti-hCD4-PE, clone SK3 (Becton Dickinson). Sample handling from this stage was 
conducted in the dark. The cells weare washed and incubated in 750 pL IxFACS Penn buffer 
(Becton Dickinson) for 10 min at room temperature. The cells were pelleted and re-suspended in 
PBS/2%FBS and 0.1 ng of FTTC-anti-hlFN-y, clone MD-1 (Biosource) was added. After 30 min 
5 incubation, Ihe cells were washed and re-suspended in PBS. Samples, were analyzed using all 
four color channels of the Becton Dickinson FACSCalibur instrument. To analyze 4e data, the 
low side- and forward-scatter lymphocyte population was initially gated; a common fluorescence 
cut-off for cytokine-positive events was used for botii CaM"^ and CD8* populations, and for both 
mock and gag-peptide reaction tubes of a sample. 

10 ' 

D. Anti-D24 ELISA 

A modified competitive anti-p24 assay was developed using reagents from the 
Coulter p24 Antigen Assay kit (Beckman Coulter, FullCTton, CA). Briefly, to a 250-vL serum 
sample, 20 nL of Lyse Buffer and 15 pL of p24 antigen (9.375 pg) from the Coulter kit were • 

15 added. After mixing, 200 jiL of each sample were added to wells coated wiA a mouse anti-p24 
mAb from the Coulter kit and incubated for 1 .5 hr at 37''C. The wells were then washed and 200 
\iL of Biotin Reagent (polyclonal anti-p24-biotin) from the Coulter kit was added to each well. 
After a 1 hr, 37°C incubation, detection was achieved using strepavidia-conjugated horseradish 
peroxidase and TIVIB substrate as described in llie Coulter Kit OD450nm values were recorded. 

20 A 7-point standard curve was genoated usmg a serial 2-fold dilution of serum from an HIV- 
seropositive individual. The lower cut-off for the assay is arbitrarily set at 10 milli Merck 
units/mL (mMU/mL) defined by a dilution of the seropositive human serum. This cutoff Ms at 
approximately 65% of the m^yi""™ bound control signal which corresponds to that obtained 
with the dilusat control only and with no positive analyte. 

25 

E. Results 

PBMCs collected at regular 4-wk intervals were analyzed in an ELISPOT assay 
(Figure 17). Both Ad24AElgagAOrf5Ad50rf6 and Ad24AElgagAE4Ad50rf6 were able to 
induce significant levels of gag-specific T cells in non-human primates. At lO'^l 1 vp dose level, 
30 fee Ad24-induced req)onses were witiiin 2-3-fold of those of MRKAd5-HIVl gag. Both Ad24 
vectors were also able to induce detectable levels of gag-specific T cells at 10^10 vp but were 
lower than fliose observed using MRKadSgag at fee same dose. 

PBMCs collected at wk 12 from fee vaccinees were analyzed for mtracellular 
IFN-y staining after the primmg immunizations. The assay results provided information on fee 
3 5 relative amounts of CD4* and CD8* gag-specific T cells m fee peripheral blood (Figure 18). The 
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results indicated that the prime-boost imnmnization approach was able to elicit in rhesus 
macaques both HIV-specific CD4'' and CDS'" T ceUs. 

V TTnTTinranniTTi ime Responses 
5 The Ad24-based vaccine vector was able to generate detectable levels of 

circulating anti-gag antibodies at the reasonably high dose level (Figure 19). No detectable titers 
were observed at equal to or lower than 10^10 vp, suggesting the existence of a dose-dependent 
response. 

10 EXAMPLE 14 

In Vivo Transpene Expression 

A TTnmimization 

Cohorts of 5 C3H/HeN mice were given single intramuscular injections of one of 
15 the followiag vectors: (1) 10^10 vp Ad24AElSEAPAB4Ad5Qrf6; (2) KTIO vp 

Ad24AElSEAPAOrf6Ad50rf6; (3) 10^10 vp MRKAdSSEAP; and (4) 10^9 vp MRKAdSSEAP. 
Female mice were between 4-10 weeks old. The total dose of each vaccine was suspended in 0.1 
mL of buffer. The vectors were givm to both quadriceps of each of the animals with a volume 
of 50 uL per quad and using 0.3-mL 28G1/2 insulin syringes (Becton-Dickinson, Franklin Lakes, 
20 NJ). For the primates, the total dose of each vaccine was suspended in 1 mL of buflFer. The 

monkeys were anesthetized (ketamine/xylazine mixture) and the vaccines were delivered i.m. in 
0.5-niL aliquots into two muscle sites using tuberculin syringes (Becton-Dickinson, Franklin 
Lakes, NJ). Serum samples were collected at defined intervals and stored frozen until the assay 
date. All animal care and treatment were in accordance with standards approved by the 
25 Institutional Animal Care and Use Committee according to the principles set forth in the Guide 
for Care and Use of Laboratory Animals, Institute of Laboratory Animal Resources, National 
Research Council, 

SEAP Assay 

30 Serum samples were analyzed for circulating SEAP levels using TROPDC . 

phospha-light chemiluminescent kit (Applied Biosystems Inc). Duplicate 5 uL aliquots of each 
serum were mixed with 45 uL of kit-supplied dilution buffer in a 96-well white DYNEX plate. 
Serially diluted solutions of a human placental alkaline phosphatase (Catalog no, M5905, Sigma, 
St Louis, MO) in 10% naive monkey serum served to provide the standard curve. Endogenous 

35 SEAP activity in the samples was inactivated by heating tiie wells for 30 minutes at 65 ""C. 
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Enzymatic SEAP activities in the samples were detennined following the procedures described 
in the kit Chemiluminescence readings (in relative Ught imits) were recorder using DYNEX 
luminometer. RLU readings are converted to ng/mL SEAP using a log-log regression analyses. 

f 

5 C. Rodent Results 

Serum sanq)les prior to and after the injection were analyzed for circulating SEAP , 
activities and the results are shown in Figure 20. Results indicate tiiat (1) both Ad24 constiiicts 
are all capable of expressing the SEAP faransgene in vivo to comparable levels; and that (2) the 
level of expression achieved using the Ad24 vectors are comparable to that of Ad5 at 10-fold 

10 lower dose. Hie levels of SEAP in the serum dropped dramatically after day 2 and were at 
background levels by day 1 2. 

D. Primate Results 

Cohorts of 3 rhesus macaques were gjvffli single intramuscxilar injections of one 

of the following vectors: (1) 10^1 1 vp MRKAd5-SEAP; (2) 10^ vp MRKAd5-SEAP; (3) 10^1 1 
15 vp Ad24AElSEAPAOrf6Ad50rf6; or (4) lO'^l 1 vp Ad24AElSEAPAE4Ad50rfl5. Serum 
samples prior to and after the injection were analyzed for circulating SEAP activities and the 

results are shown in Figure 2 1 . 

Results indicate that the peak levels of SEAP product produced by adenovirus 
serotype 24 were lower tiian but were within 3-fold of that of MRKAdS at tiie same high dose 
20 level of lO'^l 1 vp (Figure 21). The levels observed with adenovirus serotype 24 are generally 
50-fold higher than diose observed using 10^9 vp of MRKAd5 . The levels of SEAP in the 
serum dropped dramaticaUy after day 10 and were close to background as early as day 15. These 
observations strongly indicate that adenovirus serotype 24 is very efficient in expressing a 
transgene following intramviscular administration in a primate. 

25 

EXAMPLE 15 

Cnnstruction of pMRKAd24AEl AE4Ad50rf'6 

To construct pMRKAd24AElAE4Ad50rf5 (An Ad24 pre-Ad plasmid, composed 

entirely of Ad24lequaice and containing an El deletion and an E4 deletion substituted with 
30 Ad5 Orf5), an Ad24 ITR cassette was constiucted containing sequences from the right (bp 3 1 978 
to 32264 and bp 34713 to 35164) and left (bp 4 to 450 and bp 3364 to 3799) end of the Ad24 
genome separated by plasmid sequences containing a bacterial origin of replication and an 
ampicillin resistance gene. These four segments were generated by PGR and cloned sequentially 
into pNEB 193, generating pNEBAd24-4. Next the Ad5 OrfIS open reading frame (Ad5 bp 3 1 192 
35 to bp 34078) was generated by PGR and cloned between Ad24 bp 32264 and 347 1 3 generating 
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pNEBAd24E-Ad5C)rfB (flie ITR cassette). PNEB193 is a commonly used commercially 
available cloning plasmid (New Englaod Biolabs cat# N305 1 S) containing a bacterial origin of 
replication, ampicillin resistance gaie and a multiple cloning site into v^diich the PGR products 
were introduced. The ITR cassette contains a deletion of El sequences from Ad24 bp 45 1 to 

5 3363 wiHi a unique Swa I restriction site located in the deletion and an E4 deletion from Ad24 bp 
32265 to 34712 into which Ad5 OrfiS was introduced in an E4 parallel orieatation. In this 
construct Ad5 Orf6 expression is driven by the Ad24 E4 promoter. The Ad24 sequences (bp 
3 1978 to 32264 and bp 3464 to 3799) in the ITR cassette provide regions of homology with flie 
purified Ad24 viral DNA in which bacterial recombination can occur following cotransfdrmation 

10 into B J 5 1 83 bacteria (Figure 22). The ITR cassette was also designed to contain unique 

restriction enzyme sites (Pmel) located at the aid of the viral TTR's so that digestion will rdease 
flie recombinant Ad24 genome from plasnrid sequences. Potential clones will be screeaied by 
restriction analysis and one clone was selected as pMRKAd24AElAE4Ad50rf6. Pre- 
Adenovirus plasmid pMRKAd24AE 1 AE4Ad50rf6 should contain Ad24 sequences from bp 4 to 

15 450; bp 3364 to bp 32264 and bp 34713 to bp 35164 with Ad5Qrf6 cloned between bp 32264 
and bp 34713. The bp numbering in the above description refers to the wt sequence for both 
Ad24andAd5. 

EXAMPLE 16 

20 TnsertioD of HIV- 1 sag and SEA? trans^enes into p Ad24AEl AE4Ad50rfl5 

In order to introduce a gag or SEAP expression cassettes into the El region of 
pMRKAd24AElAE4Ad50rf6, bacterial recombination will be used. An HTV-l gag expression 
cassette will consist of the following: 1) the immediate early gene promoter from the human 
cytomegalovirus, 2) the coding sequence of the human immunodeficiency virus type 1 pUV-l) 

25 gag (strain CAM-1 ; 1526 bp) gene, and 3) the bovine growth hormone polyadenylation signal 
sequence, in the El deletion of an Ad24 shutde plasmid, pNEBAd24-2 (a precursor to the Ad24 
ITR cassette described above), generating pNEBAd24CMVgagBGHpA. PNEB Ad24-2 contains 
Ad24 sequeaices from the left end of the genome (bp 4 to 450 and bp 3364 to 3799) that define 
the El deletion. The gag expression cassette will be obtained from a previously constructed 

30 plasmid and cloned into the El deletion between bp 450 and 3364 in the El parallel orientation. 
Tlie shuttle vector containing the gag transgene will be digested to generate a DNA firagment 
consisting of the gag expression cassette flanked by Ad24 bp 4 to 450 and bp 3364 to 3799 and 
the fragment will be purified after electrophoresis on an agarose gel. Cotransformation of BJ 
5183 bacteria with the shuttle vector fragmeait and pMRKAd24AElAE4Ad50rf6 which was 

35 linearized in the El region by digestion with Swal, should result m the generation of Ad24 gag- 

-34- 



wo 2004/018627 PCTA)S2003/026145 



10 



containing pre-Adenovirus plasmids pMRKAd24AElgagAE4Ad50rfS by homologous 
lecombination. Potraitial clones will be screened by restriction analysis. 

A similar strategy will be used to generate Ad24 pre-Ad plasmids containing a 
SEAP ejq)ression cassette. In this case, a SEAP e3q)ressi(m cassette will consist of: 1) the 
immediate early gene promoter from the human cytomegalovirus, 2) the coding sequence of the 
human placental SEAP gene, and 3) the bovine growth hormone polyadenylation signal 
sequence cloned into the El deletion of an Ad24 shuttle plasmid, pNEBAd24-2, generating 
pNEBAd24CMVSEAPBGHpA. The transgene will then be recombined into 
pMRKAd24AE 1 AE4Ad50rf6 as described above for the gag transgene. 

EXAMPLE 17 

In Vivo Immuno^enicitv 

A Tmim inization 

15 Rhesus macaques were between 3-10 1^ in weight In all cases, the total dose of 

each vaccme was suspaided in 1 mL of buffer. The macaques w«e anesthetized 
(ketamme/xylazine) and tiie vaccines were delivered i.m. in 0.5-mL aliquots into both deltoid 
muscles using tuberculin syringes (Becton-Dickinson, Franklin Lakes, NJ). Peripheral blood 
mononuclear cells (PBMC) were prepared from blood sanq)les coUected at several time points 

20 during the immunization regimai. All animal care and treatment were in accordance with 
standards approved by the Institutional Animal Care and Use Committee according to the 
principles set forth in the Guide for Care and Use of Laboratory Animals, Institute of Laboratory 
Animal Resources, National Research Council. 

25 R. T Cell Responses 

Ad24 Vaccine Vector as a Heterologous Booster Cohort of 4 rhesus macaques 

was immunized initially with 3 doses (wk 0, 4, 26) of either lO' or lO' vp of MRKAd5-gag {see, 
PCT/USOl/28861, published March 21, 2002) orMRKAd6-gag. At wk 56, the animals received 
a booster vaccine of lO" vp Ad24AElgagAOrf6Ad5C)rf5. A separate cohort of naive animals 

30 received a single dose of the booster vaccine. The results of flie EFN-y ELISPOT analyses of 
PBMC collected during the course of the studies are shown in Figure 23. It is apparent that the 
Ad24 HIV vectors can be utilized to amplify the existing pools of HIV-specific T cells. The 
increases in the levels of gag-specific T cells from the pre-boost levels to those measured at 4 
wks post boost were consistently larger than the levels induced by the same booster vaccine in 

35 naive ammals. PBMCs from the vaccinees of the heterologous MRKAd5/MRXAd6-Ad24 boost 
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regimen were analyzed for mtracellular IFN-y staining after the priming immunizations (wk 60). 
The assay results provided information on flie relative amounts of CIM"^ and CDS'*" gag-specific 
T cells in the peripheral blood (Figure 24). The results indicated that heterologous prime-boost 
immunization approach was able to elicit in rhesus macaques both HTV-specific CD4+ and 
5 CD8+T cells. 

Ad24 Vaccine Vector as a Heterologous Primen In a separate study, a cohort of 3 
rhesus macaques was immunized initially with 2 doses (wk 0, 4) of 10 vp 
Ad24AElgagAOrf5Ad50rf6 and boosted at wk 24 with 10^ vp of MRKAd5-gag. The low dose 
of MRKAd5-gag is selected to mimic the effect of pre-existing neutralizmg immunity to the 

10 vector in a subject A separate cohort ofnaive animals was given a single dose of 10 vp 
MRKAd5-gag. The results of the IFN-y ELISPOT analyses of PBMC collected during the 
course of the studies are shown in Figure 25. 

The Ad24-based vaccine was able to prime effectively for HIV-specific T cell 
responses in macaques. Boosting with a low dose MRKAd5-gag resulted in a significant 

1 5 increase in the levels of gag-specific T cells. The increases in 2 out of 3 animals exceed the 
levels typically observed afl^ treatment ofnaive animals with the same low dose of MMCAd5- 

gag. 

EXAMPLE 18 

20 Construction of DAd34AElAE4Ad50rf6 

To generate an El- Ad34 based vector that can propagate in existing group C/Ad5 
El complementing ceU lines (293, PER.C6), Ad5 OrfS was inserted in place of the native E4 
region. Sinceatthetime,thecompletesequenceof Ad34(5ec Figures 28A-1 to 28A-9; subject 
of copending application serial no. 60/458,825, filed March 28, 2003) was unknown, advantage 

25 was taken of the sequence homology between Ad34 and Ad35 in order to construct the Ad34 
pre- Adenovirus plasmid. Cotransformation of BJ 5 183 bacteria with purified wild-type Ad34 
viral DNA and the appropriately constructed Ad35 ITR cassette resxilted in the curcularization of 
the viral genome by homologous recombination. The construction of the pre- Ad plasmid based 
on Ad34, is outlined below: 

3Q To construct pAd34AEl AE4Ad50rfi5 (An Ad34 pre-Ad plasmid containing an El 

deletion and an E4 deletion substituted with Ad5 C)rf5), we utilized an Ad35 ITR cassette. We 
anticipated that sequence homology between Ad34 and Ad35 would allow homologous 
recombination to occur. The Ad35 ITR cassette was constructed containing sequences from the 
right (bp 31599 to 31913 and bp 34419 to 34793) and left (bp 4 to 456 and bp 3403 to 3886) end 

35 of the Ad35 genome {see Figures 2A-1 to 2A-10) separated by plasmid sequences containing a 
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bacterial origin of replication and an anq)iciilin resistance gene. The four segments were 
generated by PGR and cloned sequentially into pNEB 1 93, generating pNEB Ad354. Next the 
Ad5 Orf5 open reading frame was generated by PGR and cloned between Ad35 bp 3 1913 and 
34419 generating pNEBAd35-4Ad50rfiS (the ITR cassette). PNEB193 is a commonly used 

5 commercially available cloning plasmid (New England Biolabs cat# N305 1 S) containing a 
bacterial origin of repUcation, ampidllin resistance gene and a multiple cloning site mto which 
flie PGR products were intixxhiced. The ITR cassette contains a deletion of El sequences from 
Ad35 bp 457 to 3402 with a vmique Swa I restriction site located in the deletion and an E4 
deletion from Ad35 bp 3 1914 to 34418 into which Ad5 Orf5 was introduced in an E4 parallel 

10 orientation. In tiiis constiuct AdSOrfiS expression is driven by the Ad35 E4 promoter. The Ad35 
sequences (bp 31599 to 3 1913 and bp 3403 to 3886) in the ITR cassette provided regions of 
homology witii the purified Ad34 viral DNA in which bacterial recombination could occur 
foUowing cotransformation into BJ 5183 bacteria (Figure 26). The ITR cassette was also 
designed to contain unique restriction enzyme sites (Pmel) located at the end of the vfral ITR's 

15 so that digestion would release the recombinant Ad34 genome from lie plasmid sequences. 
Potential clones were screened by restriction analysis and one clone was selected as 
pAd34AE 1 AE4Ad5C)rf6. 

EXAMPLE 19 

20 Rescue of pAH:^4AElAE4A d5Qrf6 into Virus 

In order to detamine if pre-adenovirus plasmid pAd34AElAE4Ad50rf5, could be 
rescued into virus and propagated in a groiq) C El con^lanraiting cell line, tixe plasmid was 
digested with Pme I and transfected into T-25 flasks of PERC6 ceUs using tiie calcium 
phosphate co-precipitation technique (GeU Phect Transfection Kit, Amersham Pharmacia 

25 Biotech Inc). Pmd digestion releases the viral genome from plasmid sequences allowing viral 
repHcation to occur after ceU entty. Viral cytopathic effect (CPE), indicating that virus 
replication and amplification was occurring was observed following transfection. When CPE 
was complete, approximately 7-10 days post transfection, tihe infected cells and media wore 
harvested, freeze/thawed three times and the cell debris pelleted by centrifugation. 

30 Approximately 1 ml of the cell lysate was used to infect a T-225 flask of PERC6 cells at 80- 
90% confluence. Once CPE was reached, infected cells and media were harvested, 
freeze/thawed three times and die cell debris pelleted by centiifiigation. Clarified cell lysates 
were then used to infect 2-layer NUNC cell fectories of PER.G6 cells. FoUowing complete CPE, 
tiie vims was purified by ultiBcentiifiigation on GsCl density gradients. In order to verify the 

35 genetic structure of tiie rescued viruses, viral DNA was extracted using pronase treatment 
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followed by phenol chlorofonn extraction and ethanol precipitation. Viral DNA was then 
digested with iffndin and treated with Klenow fragment to end-label the restriction fragments 
with P33-dATP. Hie end-labeled restriction fragments were then size-fractionated by gel 
electrophoresis and visualized by autoradiography. The digestion products were con5)ared with 
5 the digestion products of the corresponding pre-Adenovirus plasmid (that had been digested with 
Pmel/Hindni prior to labelhig) from which they were derived The expected sizes were 
observed, indicating that the vmises had been successfully rescued. 

EXAMPLE 20 

10 Tngerrinn of an Expression Cassette into pAd34A ElAE4AdSOrf6 

In order to introduce a gag or SEAP expression cassette (see Figures 6 and 7, 
respectively) into tiie El region of pAd34AEl AE4Ad50rfl5, bacterial recombination was again 
used. A gag expression cassette consisting of the following: 1) the immediate early gene 
promoter from human cytomegalovirus, 2) the coding sequence of flie human immunodeficiency 

15 virus type 1 (HIV-1) gag (strain CAM-1; 1526 bp) gene, and 3) the bovine growth hormone 
polyadenylation signal sequence, was cloned mto flie El deletion in Ad35 shuttle plasmid, 
pNEBAd35-2 (a precursor to the Ad35 ITR cassettes described above), generating 
pNEBAd35CMVgagBGHpA. pNEBAd35-2 contains Ad35 sequences from Ihe left end of the 
genome (bp 4 to 456 and bp 3403 to 3886) with a unique Swal site between bp 456 and 3403 at 

20 the position of the deletion. The gag expression cassette was obtained from a previously 

constructed shuttle plasmid by EcoRI digestion. Following the digestion the desired fragment 
was gel purified, treated with Klenow to obtain blunt ends and cloned into the Swal site in 
pNEBAd35-2. This cloning step resulted in the gag expression cassette being inserted into the 
El deletion between bp 456 and 3403 in the El parallel orientation. The shuttle vector 

25 containmg the gag transgene was digested to generate a DNA fragment consisting of the gag 
expression cassette flanked by Ad35 bp 4 to 456 and bp 3403 to 3886 and tiie fragment was 
purified after electrophoresis on an agarose gel. Cotransformation of BJ 5 183 bacteria with the 
shutfle vector Augment and pAd34ABlAE4Ad50rf6, linearized in the El region by digestion 
wifli Swa I, resulted in the generation of tiie Ad34 gag-containing pre-Adenovirus plasmid 

30 pAd34AE 1 gagAB4Ad50rf5 by homologous recombination. Potential clones were screened by 
restriction analysis. 

A similar strategy was used to generate Ad34 pre-Ad plasmids containmg a SEAP 
expression cassette. In this case a SEAP expression cassette consisting of: 1) the immediate 
early gene promoter from human cytomegalovirus, 2) the coding sequence of the human 
35 placental SEAP gene, and 3) the bovine growth hormone polyadenylation signal sequence was 
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cloned into the El deletion in Ad35 shuttle plasmid, pNEBAd35-2, generating 
pNEBAdSSCMVSEAPBGHpA. The SEAP expression cassette was obtained from a previously 
constructed shuttle plasmid by EcoBl digestion. FoUowing the digestion the desired fragment 
was gel purified, treated with Klenow to obtain blunt ends and cloned into the Swal site in 
5 pNEBAd35-2. The transgene was then recombined into the pAd34AElAE4Ad50rf5, generating 

pAd34AElSEAPAE4Ad50rfB as described above for the gag transgene. 

All pre-Ad plasmids were rescued into virus and eqanded to prepare CsCl 

purified stocks as described above. 

10 EXAMPLE21 

rnnstriiction of pMRKAd34A P.1 AR4Ad50rf6 

To construct an Ad34 pre-Ad plasmid that was composed entirely of Ad34 
sequences, an Ad34 TTR cassette was generated. Hie Ad34 ITR cassette was constructed 
containing sequences from fbe right (bp 3 1584 to 3 1895 and bp 34409 to 34772) and left (bp 4 to 

15 456 and bp 3402 to 3885) end of the Ad34 genome (see Figures 28A.1 to 28A-9) separated by 
plasmid sequences containing a bacterial origin of replication and an ampicillin resistance gene. 
These four segments were generated by PGR and cloned sequentially into pNEB193, generating 
pNEBAd34-4. Next the Ad5 OrfS open reading frame was generated by PGR and cloned 
between Ad34 bp 3 1895 and 34409 generating pNEBAd34-4Ad50rf6 (the ITR cassette). 

20 PNEB193 is a commonly used commercially available cloning plasmid (New England Biolabs 
cat# N305 1 S) containing a bacterial origin of replication, ampicillin resistance gene and a 
multiple clomng site into which the PGR products were mtroduced. The ITR cassette contains a 
deletion of El sequences from Ad34 bp 457 to 3401 with a unique Swa I restriction site located 
in Ihe deletion and an E4 deletion from Ad34 bp 3 1 896 to 34408 into which Ad5 Orf6 was 

25 inttoduced in an E4 parallel orientation. In this consfructAd50rf6 expression is driven by the 

Ad34 E4 promoter. The Ad34 sequences (bp 31584 to 31895 and bp 3402 to 3885) in the ITR 
cassette provided regions of homology with the purified Ad34 viral DNA in which bacterial 
recombination could occur following cotransformation into BJ 5183 bacteria (Figure 27). The 
ITR cassette was also designed to contain unique restriction enzyme sites (Pmel) located at the 
30 end of the viral ITR's so that digestion would release Ihe reconibinant Ad34 genome from the 
plasmid sequences. Potential clones were screened by resttiction analysis and one clone was 
selected as pMRKAd34AEl AE4Ad50rf5. 
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EXAMPLE 22 . 
In Vivo Studies 

A. Tmrnunization 

5 Cohorts of 3 rhesus macaques were given single intramuscular injections of one 

of the two vectors: (1) KK'l 1 vp MRKAd5-SEAP (in MRKAd vector backbone disclosed in 
PCTAJSOl/28861, published March 21, 2002); and (2) 10^11 vp Ad34AElSEAPAE4Ad50rf6. 
Rhesus macaques were between 3-10 kg in weight In all cases, the total dose of each vaccine 
was suspended in 1 mL of buffer. The macaques were anesthetized (ketamine/xylazine) and the 

10 vaccines were delivered i.m. in 0.5-niL aliquots mto both deltoid muscles using tuberculin 

syringes (Becton-Dickinson, Franklin Lakes, NJ). Peripheral blood mononuclear cells (PBMC) 
w^e prepared from blood samples collected at several time points during the immunization 
regimen. All animal care and treatment were in accordance with standards approved by the 
Institutional Animal Care and Use Committee according to the principles set forth in the Guide 

1 5 for Care and Use of Laboratory Animals^ Institute of Laboratory Animal Resources, National 
Research Council. 

B. SEAPAssav 

Serum samples were analyzed for circulating human secreted alkaline 
20 phosphatase (SBAP) levels using TROPDC phospha-hght chemiluminescent kit (Applied 

Biosystems Inc). Duphcate 5 ^L aliquots of each serum were mixed with 45 fiL of kit-supplied 
dilution buffer in a 96-well white DYNEX plate. Serially diluted solutions of a human placental 
alkaline phosphatase (Catalog no. M5905, Sigma, St Louis, MO) in 10% naive monkey serum 
served to provide the standard curve. Endogenous SEAP activity in the samples was mactivated 
25 by heating the well for 30 minutes at 65 "^C. Enzymatic SEAP activities in the sanqples were 
determined following the procedures described in the kit Chemiluminescence readings (in 
relative light units) were recorded using DYNEX luminometer. RLU readings were converted to 
ng/mL SEAP using a log-log regression analyses. 

30 n. ELISPOTAssav 

The IFN-y ELISPOT assays for rhesus macaques were conducted following a 
previously described protocol (Allen et al, 2001 /. nrol 75(2):738-749), with some 
modifications. For antigen-specific stimulation, a peptide pool was prepared from 20-aa 
peptides that encompass the entire HIV-1 gag sequence with 10-aa overlaps (Syrpep Coip-, 
35 Dublin, CA). To each well, 50 |xL of 2-4 x 10^ peripheral blood mononuclear cells (PBMCs) 
were added; the cells were counted using Beckman Coulter 22 particle analyzer with a Iowct 
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size cut-off set at 80 femtoliters ("fL"). Either 50 ^iL of media or the gag peptide pool at 8 
lig/mL concentration per peptide was added to Ihe PBMC. The samples were incubated at 3TC, 
5% CO2 for 20-24 hrs. Spots were developed accordingly and the plates were processed using 
custom-built imager and automatic countmg subroutine based on the ImagePro platform (SUver 
5 Spring, MD); the counts were normalized to 1 0' cell input. 

D. Intracellular nvtokine Staining aCS) 

To 1 ml of 2 X 10* PEMOmL in complete KPMI media (in 17xl00nun round 
bottom polypropylene tubes (Saretedt. Newton, NC)), anti-hC3D28 (clone L293, Becton- 

1 0 Dickinson) and anti-hCD49d (clone L25, Becton-DickiDSon) monoclonal antibodies were added 
to a final concentration of 1 Hg/imL. For gag-specific stimulation, 10 pL of tbe peptide pool (at 
0.4 mg/mL per peptide) were added. The tubes were incubated at 37 °C for 1 hr., after which 20 
of 5 mg/mL of biefeldin A (Sigma) were added. The ceUs were incubated for 16 hr at 37 "C, 
5% CO2, 90% humidity. 4 mL cold PBS/2%FBS were added to each tube and the ceUs were 

15 pelleted'for 10 min at 1200 rpm. The cells were re-suspended in PBS/2%FBS and stained (30 
min, 4 °C) for surface markers using several fluorescent-tagged mAbs: 20 per tube anti- 
hCD3-APC, clone FN-18 (Biosource); 20 jxL anti-hCD8-PerCP, clone SKI (Becton Dickinson); 
and 20 \jlL anti-hCD4-PE, clone SK3 (Becton Dickinson). Sanq)le handling from this stage was 
conducted in the dark. The cells were washed and incubated in 750 itL IxFACS Perm buffer 

20 (Becton Dickinson) for 10 min at room temperature. The ceUs were pelleted and re-suspended in 
PBS/2%FBS and 0.1 ^g of FITC-anti-hlFN-y, clone MD-1 (Biosource) was added. After 30 min 
incubation, the cells were washed and re-suspended in PBS. Samples were analyzed using aU 
four color channels of the Becton Dickinson FACSCalibur instrument. To analyze the data, the 
low side- and forward-scatter lymphocyte population was initially gated; a common fluorescence 

25 cut-off for cytokine-positive events was used for both CD4* and CD8* populations, and for both 
mock and gag-peptide reaction tubes of a sample. 

E. Results 

Expression: Serum samples prior to and after the injection were analyzed for 
30 circulating SEAP activities and the results are shown in Figure 29. Results indicate that flie 
peak levels of SEAP protein produced by the alternative adenovirus serotype were lower than 
but were within 3-fold of that of MRKAdS at the same high dose level of 10^1 1 vp (Figure 29). 
The levels of SEAP in the serum dropped dramatically after day 10 and were close to 
background as early as day 15. These observations strongly indicate that the Ad34-based vector 
35 is efScient in expressing a transgene following intramuscular administration in a primate. 

-41- 



wo 2004/018627 



PCTAJS2003/026145 



Tmrnimo penicitv! V accine-induced T cell responses against HIV-1 gag were 
quantified using IFN-gamma ELISPOT assay against a pool of 20-aa peptides that encompassed 
die entire protein sequence. The results arc shown in Figure 30; &ey are e3q)ressed as the 
number of spot-forming cells (SFC) per million peripheral blood mononuclear cells (PBMCs) 
5 that responded to the peptide pool or the mock (no peptide) control 

Immxmization witti gag-expressing Ad34 vector induced detectable levels of 
circulating gag-specific T cells immediately after a single dose of the vector. The responses 
improved following a second dose given at wk 4. Overall, the responses to the Ad34-based 
vector w^e slightly lower than those induced by the same dose of MRKAd5-gag. The results 
10 strongly indicate die Ad34-based vector can prime effectively for HTV-specific T cell responses. 

IFN-Y ICS analyses of the PBMC from the Ad34-immunized animals revealed 
that the vector can induce detectable levels of both CD4* and CDS'** HIV-specific T cells (Figure 
31). 

15 EXAMPLE 23 

Heterologous Immunization 

Cohorts of 3 monkeys were immunized (at wks 0, 4) with lO'^l 1 vp 

Ad34AElgagAE4Ad50rf5 followed by a booster at week 24 wifli ICK'IO vp 

Ad35AElgagAE4Ad50rf6. Vaccine-induced T cell responses against HIV-1 gag were 
20 quantified using BFN-gamma ELISPOT assay agamst a pool of 20-aa peptides that encompassed 

the entire protem sequence. The results are shown in Figure 32; they are expressed as the 

number of spot-forming cells (SFC) per million peripheral blood mononuclear cells (PBMCs) 

that responded to the peptide pool or the mock (no peptide) control. 

Immunization with gag-e3q)ressing Ad34 vector induced detectable levels of 
25 circulating gag-specific T cells that decreased to between 94-139 SFC/10^6 PBMC at the time of 

the boost. Heterologous immunization with an Ad35-based HIV vector resulted in as much as a 

3-fold increase in T cell responses. 

IFN-y ICS analyses of die PBMCs firom the Ad34 primed/Ad35 boosted animals 

at week 28 revealed that the vector can induce detectable levels of both CD4**" and CDS"*" HIV- 
30 specific T cells (Figure 33). 
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WHAT IS CLAIMED IS: 

1 . A means for propagating replication-defective adenovirus in an adenoviral 
El-complementing cell line expressing El gene product(s) which are non-native to the 

5 adenovirus, which comprises: 

(a) inserting all or a portion of a heterologous adenoviral B4 region 
comprising nucleic acid sequence encoding open reading frame 6 (ORF6) into a replication- 
defective adenovirus; wherein the E4 region or portion thereof inserted into the adenovirus is 
native to a virus of the same adenovirus serotype as the El gene product(s) expressed by the 

10 complementing cell line; 

(b) introducing the replication-defective adenovirus into the adenoviral El- 
complementing cell line; 

(c) allowing the replication-defective adenovirus to propagate in the 

adenoviral El-complementing cell line; and 
15 (d) rescuing the propagated adenovirus. 

2. A means in accordance with claim 1 wherem die heterologous adenoviral 
E4 region or portion thereof conq)rises the complete adenoviral E4-encoding region. 

3. A means in accordance with claim 2 wherein the heterologous adenoviral 
E4 region or portion tiiereof comprises the con5)lete adenoviral E4-encoding region and native 

20 E4 promoter. 

4. A means in accordance with claim 1 wherein the heterologous adenoviral 
E4 region or portion thereof is inserted into the replication-defective virus in place of nucleic 
acid sequence encoding open reading frame 6 (0RF6). 
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5. A means in accordance wifli claim 1 wherein the heterologous adenoviral 
E4 region or portion thereof is inserted into the replication-defective virus in place of nucleic 
acid sequence encoding the complete adenoviral E4-encoding region. 

6. A means in accordance with claim 1 wherein the heterologous adenoviral 
E4 region or portion thereof is derived from a subgroup C adenovirus. 

7. A means in accordance with claim 1 wherein the subgroup C adenovmis is 

adenovirus of serotype 5. 

8. A means in accordance with claim 7 wherein tiie replication-defective 

adenovirus is an adenovirus of subgroiq) B. 

9. A means in accordance with claim 7 wherein the replication-defective 

adraovirus is an adenovirus of serotype 35. 

10. A means in accordance with claim 1 wherein the heterologous adenoviral 

E4 region or portion thereof is operatively linked to a heterologous promoter. 

1 1. A means in accordance with claim 1 wherein the adenoviral El- 
complementing cell line is a PER.C6® cell line. 

1 2. A replication-defective adenovirus comprising all or a portion of a 
heterologous E4 region comprising a heterologous adenoviral open reading frame 6 (0RF6). 

13. A replication-defective adenovirus in accordance with claim 12 wh^ein 

the adenovirus comprises a heterologous gene of interest. 

14. A replication-defective adenovirus in accordance with claim 13 wherein 
the heterologous gene of interest is a gene encodmg an HIV-1 antigen. 

1 5. A replication-defective adenovirus in accordance with claim 14 wherein 
the HIV-1 antigen is selected from the groiqp consisting of HTV-l gag, pol, nef and env. 
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16. A replication-defective adenovirus comprising all or a portion of a 
heterologous E4 region comprising a heterologous adenoviral open reading frame 6 (0RF6) and 

a gene encoding HIV-1 gag. 

17. A replication-defective adenovirus comprising all or a portion of a 

5 heterologous E4 region comprising a heterologous adenoviral open reading frame 6 (0RF6) in 
place of a native E4 region or portion thereof comprising ORF6. 

1 8. A replication-defective adenovirus comprising all or a portion of a 
heterologous E4 region comprising a complete heterologous E4 region in place of a complete 
native E4 region. 

10 19. A replication-defective adenovirus comprising a heterologous E4 region 

or portion thereof comprising a complete heterologous E4 region including E4 promoter in place 
of a complete native E4 region. 

20. Adenovirus propagated in accordance with the means of claim 1 . 

21. A means in accordance with claim 1 wherein the replication-defective 
1 5 adenovirus comprises a heterologous gene of interest 

22. A means in accordance with claim 21 wherein the heterologous gene of 

interest is a gene encoding an HIV- 1 antigen. 

23 . A means in accordance with claim 22 wherein the HIV- 1 antigen is 
selected from the group consisting of: HTV- 1 gag, pol, nef and env. 

20 24. A replication-defective adenovirus of serotype 35 comprising all or a 

portion of an adenovirus serotype 5 E4 region comprising open reading frame 6 (0RF6) and a 
heterologous gene of interest. 

25. A replication-defective adenovirus in accordance with claim 24 wherein 

the heterologous gene of interest is a gene encoding an HIV-1 antigen. 
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26. A replication-defective adenovirus in accordance witii claim 25 wherein 
Ihe HTV- 1 antigen is selected from liie group consisting of: HTV-l gag, pol, nef and env. 

27. A replication-defective adenovirus of serotype 35 comprising all or a 
portion of an adenovirus serotype 5 E4 region comprising open reatfing frame 6 (0RF6) and a 

5 gene encoding HIV-1 gag. 

28. A recombinant adenoviral vector of serotype 24 which comprises an E4 
gene or a segmrat of an E4 gene comprising open reading frame 6 ("ORF6") of an alternative 
serotype. 

29. A population of cells comprising the recombinant adenoviral vector of 

10 claim 28. 

30. A method for producing recombinant, replication-defective adenovirus 

particles comprising: 

(a) introducing a recombinant adenoviral vector of claim 28 into a population of 

cells expressing adenovirus El; and 
15 (b) harvesting the resultant recombinant, replication-defective adenovirus. 

3 1 . Purified recombinant, replication-defective adenovirus particles harvested 

in accordance with the method of claim 30. 

32. A conq>osition comprising purified recombinant adenovirus particles in 

accordance with claim 31. 
20 33. A composition in accordance with claim 32 which comprises a 

physiologically acceptable carrier. 

34. A recombinant adenoviral vector in accordance with claim 28 which is at 
least partially deleted in El and devoid of El activity and comprises a heterologous nucleic acid. 
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35. A composition comprising purified recombinant adenoviral particles in 
accordance with claim 31 which are at least partially deleted in El and devoid of El activity and 
comprise a heterologous nucleic acid. 

36. A method for effecting the delivery and expression of heterologous 
nucleic acid comprising administering the composition of claim 35 prior or subsequent to 
administration of the heterologous nucleic acid with the same or different vector. 

37. A metiiod in accordance with claim 36 wherein .tiie composition is 
preceded or followed by administration of heterologous nucleic acid with an adenovirus of a 
different serotype. 

38. A composition in accordance with claim 35 wherein the heterologous 

nucleic acid encodes an HIV antigen. 

39. A method for generating a cellular-mediated immune response against 
HTV in an individual comprising admmistering to the individual a conqwsition of claim 38. 

40. A composition in accordance with claim 39 wherein the HIV antigen is 
HTV-l gag or immunologically relevant modification thereof. 

41 . A composition in accordance with claim 39 wherein the HTV antigen is 
HTV- 1 nef or immunologically relevant modification thereof. 

42. A composition in accordance with claim 39 wherein the HTV antigen is 
HIV-1 pol or immunologically relevant modification thereof. 

43. A recombinant adenoviral vector of serotype 24 which is at least partially 
deleted m El and devoid of El activity; wherein said vector comprises an E4 gene or a segmrat 
of an E4 gene from adenoviras serotype 5 comprising open reading frame 6 ("0RF6"), and a 
heterologous nucleic acid. 
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44. A population of cells comprising the recombinant adenoviral vector of 

claim 43. 

45. A method for producing recombinant, replication-defective adenovirus 
particles comprising: 

5 (a) introducing a recombinant adenoviral vector of claim 43 into a population of 

cells expressing adenovirus serotype 5 El; and 

(b) harvesting the resultant recombinant, replication-defective adenovirus. 

46. Purified recombinant, replication-defective adenovirus particles harvested 

in accordance with the method of claim 45. 
10 47. A coniposition conaprising purified recombinant adenovirus particles in 

accordance with claim 46. 

48. A conaposition in accordance with claim 47 which comprises a 

physiologically acceptable carrier. 

49. A method for effecting the delivay and e^qpression of the heterologous 
1 5 nucleic acid comprising administering the composition of claim 48 prior or subsequent to 

administration of the heterologous nucleic acid with the same or different vector. 

50. A method in accordance with claim 49 above wherein the composition is 
preceded or followed by administration of the heterologous nucleic acid with an adenovirus of a 
different serotype. 

20 51. A con^sition in accordance with claim 48 wherein the heterologous 

nucleic acid encodes an HIV antigen. 

52. A method for generating a cellular-mediated immune response agamst 
HiV in an individual comprising administering to the individual a composition of claim 51. 
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53 . A composition in accordance with claim 5 1 wherein the HIV antigen is 
HTV- 1 gag or inimunologically relevant modification ttiereof. 

54. A composition in accordance with claim 5 1 wherein the HIV antigen is 
HV-l nef or immunologically relevant modification thereof. 

5 5 5. A composition in accordance with claim 5 1 wherein the HIV antigen is 

HIV- 1 pol or immxmologically relevant modification thereof. 

56 A recombinant adenoviral vector of serotype 34 which comprises an E4 
gene or a segment of an E4 gene comprising open reading frame 6 ("ORF6") of an alternative 
serotype. 

10 57. A population of cells comprising the recombinant adenoviral vector of 

claim 56. 

58. A method for producing recombuiant, rephcation-defective adenovhrus 

particles comprising: - - 

(a) introducing a recombinant adenoviral vector of claim 56 mto a population of 

1 5 cells expressing adenovirus El ; and 

(b) harvesting the resultant recombinant, replication-defective adenovirus. 

59. Purified recombinant, rephcation-defective adenovirus particles harvested 

in accordance with the method of claim 58. 

60. A composition comprising purified recombinant adenovirus particles in 

20 accordance with claim 59. 

61 . A composition in accordance with claim 60 which comprises a 

physiologically acceptable carrier. 

62. A recombinant adenoviral vector in accordance with claim 56 which is at 

least partially deleted in El and devoid of El activity and comprises a heterologous nucleic acid. 
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63 . A composition comprising purified recombinant adenoviral particles in 
accordance with claim 59 which are at least partially deleted in El and devoid of El activity and 

comprise a heterologoiis nucleic acid. 

64. A method for eflfecting flie delivery aid ejqpre^ioii of hetefologc^^^ 
5 nucleic acid comprising administering the composition of claim 63 prior or subsequent to 

administration of the heterologous nucleic acid with the same or different vector. 

65 . A method in accordance with claim 64 wherein the composition is 

I 

preceded or followed by administration of het^ologous nucleic acid with an adenovirus of a 
different serotype. 

1 Q 66. A composition in accordance with claim 63 wherein tiie heterologous 

nucleic acid encodes an HTV antigen. 

67. A method for generating a cellular-mediated immune response against 
mV in an individual comprising administering to the individual a con^sition of claim 66. 

68. A composition in accordance with claim 67 wherein the HIV antigen is 
1 5 HIV- 1 gag or immunologically relevant modification thereof. 

69. A coDoposition in accordance with claim 67 wherein tiie HIV antigen is 
HIV-1 nef or immunologically relevant modification thereof. 

70. A composition in accordance with claim 67 wherein the HTV antigen is 
HTV- 1 pol or immunologically relevant modification thereof. 

20 7 1 . A recombinant adenoviral vector of serotype 34 which is at least partially 

deleted in El and devoid of El activity; wherein said vector comprises an E4 gene or a segment 
of an E4 gene from adenovirus serotype 5 comprising open reading fiame 6 C'0RF6")> and a 
heterologous nucleic acid. 

« 
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72. A population of cells comprising the recombinant adenoviral vector of 



claim 71. 



73. A melhod for producing recombinant, rqplication-defective adenovirus 



particles con^rising: 

(a) introducing a recombinant adenoviral vector of claim 71 into a population of 

cells expressing adenovirus serotype 5 El; and 

(b) harvesting the resultant recombinant, replication-defective adenovirus. 

74. Purified recombinant, repUcation-defective adenovirus particles harvested 

in accordance with ihe method of claim 73 . 

75. A composition comprising purified recombinant adenovirus particles in 

accordance with claim 74. 

76. A conq)osition in accordance with claim 75 which comprises a 

physiologically acceptable carriw. 

77. A method for effecting the delivery and exjMBSsion of the heterologous 

nucleic acid comprismg administering the composition of claim 76 prior or subsequent to 
administration of the heterologous nucleic acid with the same or different vector. 

78. A method m accordance with claim 77 above wherein the composition is 
preceded or followed by administration of the heterologous nucleic acid with an adenovhus of a 
different serotype. 

79. A composition in accordance with claim 76 wherein the heterologous 

nucleic acid encodes an HIV antigen. 

80. A mefliod for generating.a cellular-mediated hnmune response against 
HIV in an individual comprismg administering to the individual a composition of claim 79. 
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81. A composition in accordance with claim 79 wherein the HTV antigen is 
HTV-l gag or immunologically relevant modification thereof. 

82. A composition in accordance with claim 79 wherein the HTV antigen is 
HTV-l nef or immunologically relevant modification thereof. 

5 83. A coirrposition in accordance with claim 79 wherein the HIV antigen is 

HTV-l pol or immmiologically relevant modification thCTCof. 
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1 catcatcaat aatatacctt atagatggaa tggtgccaat atgtaaatga ggtgatttta 
61 aaaagtgtgg gccgtgtggt gattggctgt ggggttaacg gttaaaaggg gcggcgcggc 
121 cgtgggaaaa tgacgtttta tgggggtgga gtttttttgc aagttgtcgc gggaaatgtt 
181 acgcataaaa aggcttcttt tctcacggaa ctacttagtt ttcccacggt atttaacagg 
241 aaatgaggta gttttgaccg gatgcaagtg aaaattgctg attttcgcgc gaaaactgaa 
301 tgaggaagtg tttttctgaa taatgtggta tttatggcag ggtggagtat ttgttcaggg 
361 ccaggtagac tttgacccat tacgtggagg tttcgattac cgtgtttttt acctgaattt 
421 ccgcgtaccg tgtcaaagtc ttctgttttt acgtaggtgt cagctgatcg ctagggtatt 
481 tatacctcag ggtttgtgtc aagaggccac tcttgagtgc cagcgagaag agttttctcc 
541 tctgcgccgg cagtttaata ataaaaaaat gagagatttg cgatttctgc ctcaggaaat 
601 aatctctgct gagactggaa atgaaatatt ggagcttgtg gtgcacgccc tgatgggaga 
661 cgatccggag ccacctgtgc agctttttga gcctcctacg cttcaggaac tgtatgattt 
721 agaggtagag ggatcggagg attctaatga ggaagctgtg aatggctttt ttaccgattc 
781 tatgctttta gctgctaatg aaggattaga attagatccg cctttggaca ctttcaatac 
841 tccaggggtg attgtggaaa gcggtacagg tgtaagaaaa ttacctgatt tgagttccgt 
901 ggactgtgat ttgcactgct atgaagacgg gtttcctccg agtgatgagg aggaccatga 
961 aaaggagcag tccatgcaga ctgcagcggg tgagggagtg aaggctgcca atgttggttt 
1021 tcagttggat tgcccggagc ttctggacat ggctgtaagt cttgtgaatt tcacaggaaa 
1081 aatactggag taaaggaact gttatgttcg cttttgttat atgaaaaccc actgccactt 
1141 tatttacagt aaagtgtgtt taagttaaaa tttaaaggaa tatgctgttt ttcacatgta 
1201 tattgagtgt gagttttgtg cttcttatta taagtcctgt gtctgatgct gatgaatcac 
1261 catctcctga ttctactacc tcacctcctg atattcaagc acctgttcct gtggacgtgc 
1321 gcaagcccat tcctgtgaag cttaagcctg ggaaacgtcc agcagtggag aaacttgagg 
1381 acttgttaca gggtggggac ggacctttgg acttgagtac acggaaacgt ccaagacaat 
1441 aagtgttcca tatccgtgtt tacttaaggt gacgtcaata tttgtgtgag agtgcaatgt 
1501 aataaaaata tgttaactgt tcactggttt ttattgcttt ttgggcgggg actcaggtat 
1561 ataagtagaa gcagacctgt gtggttagct cataggagct ggctttcatc catggaggtt 
1621 tgggccattt tggaagacct taggaagact aggcaactgt tagagagcgc ttcggacgga 
1681 gtctccggtt tttggagatt ctggttcgct agtgaattag ctagggtagt ttttaggata 
1741 aaacaggact ataaacaaga atttgaaaag ttgttggtag attgcccagg actttttgaa 
1801 gctcttaatt tgggccatca ggttcacttt aaagaaaaag ttttatcagt tttagacttt 
1861 tcaaccccag gtagaactgc tgctgctgtg gcttttctta cttttatatt agataaatgg 
1921 atcccgcaga ctcatttcag caggggatac gttttggatt tcatagccac agcattgtgg 
1981 agaacatgga aggttcgcaa gatgaggaca atcttaggtt actggccagt gcagcctttg 
2041 ggtgtagcgg gaatcctgag gcatccaccg gtcatgccag cggttctgga ggaggaacag 
2101 caagaggaca acccgagagc cggcctggac cctccagtgg aggaggcgga gtagctgact 
2161 tgtctcctga actgcaacgg gtgcttactg gatctacgtc cactggacgg gataggggcg 
2221 ttaagaggga gagggcatcc agtggtactg atgctagatc tgagttggct ttaagtttaa 
2281 tgagtcgcag acgtcctgaa accatttggt ggcatgaggt tcagaaagag ggaagggatg 
2341 aagtttctgt attgcaggag aaatattcac tggaacaggt gaaaacatgt tggttggagc 
2401 cagaggatga ttgggaggtg gccattaaaa attatgccaa gatagctttg aggcctgata 
2461 aacagtataa gatcagtaga cggattaata tccggaatgc ttgttacata tctggaaatg 
2521 gggctgaggt ggtaatagat actcaagaca agacagttat tagatgctgc atgatggata 
2581 tgtggcctgg agtagtcggt atggaagcag tcacttttgt aaatgttaag tttaggggag 
2"641 atggttataa tggaatagtg tttatggcca ataccaaact tatattgcat ggttgtagct 
2701 tttttggttt caacaatacc tgtgtagatg cctggggaca ggttagtgta cgggggtgta 
2761 gtttctatgc gtgttggatt gccacagctg gcagaaccaa gagtcaattg tctctgaaga 
2821 aatgcatatt ccaaagatgt aacctgggca ttctgaatga aggcgaagca agggtccgtc 
2881 actgcgcttc tacagatact ggatgtttta ttttaattaa gggaaatgcc agcgtaaagc 
2941 ataacatgat ttgtggtgct tccgatgaga ggccttatca aatgctcact tgtgctggtg 
3001 ggcattgtaa tatgctggct actgtgcata ttgtttccca tcaacgcaaa aaatggcctg 
3061 tttttgatca caatgtgttg accaagtgca ccatgcatgc aggtgggcgt agaggaatgt 
3121 ttatgcctta ccagtgtaac atgaatcatg tgaaagtgtt gttggaacca gatgcctttt 
3i81 ccagaatgag cctaacagga atctttgaca tgaacacgca aatctggaag atcctgaggt 
3241 atgatgatac gagatcgagg gtgcgcgcat gcgaatgcgg aggcaagcat gccaggttcc 
3301 agccggtgtg tgtagatgtg accgaagatc tcagaccgga tcatttggtt attgcccgca 
3361 ctggagcaga gttcggatcc agtggagaag aaactgacta aggtgagtat tgggaaaact 
3421 ttggggtggg attttcagat ggacagattg agtaaaaatt tgttttttct gtcttgcagc 
3481 tgacatgagt ggaaatgctt cttttaaggg gggagtcttc agcccttatc tgacagggcg 
3541 tctcccatcc tgggcaggag ttcgtcagaa tgttatggga tctactgtgg atggaagacc 
3601 cgttcaaccc gccaattctt caacgctgac ctatgctact ttaagttctt cacctttgga 
3661 cgcagctgca gccgctgccg ccgcctctgt cgccgctaac actgtgcttg gaatgggtta 
3721 ctatggaagc atcgtggcta attccacttc ctctaataac ccttctacac tgactcagga 



FIG. 2A-1 



wo 2004/018627 PCTAIS2003/026145 



3/59 

3781 caagttactt gtccttttgg cccagctgga ggctttgacc caacgtctgg gtgaactttc 
3841 tcagcaggtg gccgagttgc gagtacaaac tgagtctgct gtcggcacgg caaagtctaa 
3901 ataaaaaaaa ttccagaatc aatgaataaa taaacgagct tgttgttgat ttaaaatcaa 
3961 gtgtttttat ttcatttttc gcgcacggta tgccctggac caccgatctc gatcattgag 
4021 aactcggtgg attttttcca gaatcctata gaggtgggat tgaatgttta gatacatggg 
4081 cattaggccg tctttggggt ggagatagct ccattgaagg gattcatgct ccggggtagt 
4141 gttgtaaatc acccagtcat aacaaggtcg cagtgcatgg tgttgcacaa tatcttttag 
4201 aagtaggctg attgccacag ataagccctt ggtgtaggtg tttacaaacc ggttgagctg 
4261 ggaggggtgc attcgaggtg aaattatgtg cattttggat tggattttta agttggcaat 
4321 attgccgcca agatcccgtc ttgggttcat gttatgaagg actaccaaga cggtgtatcc 
4381 ggtacattta ggaaatttat cgtgcagctt ggatggaaaa gcgtggaaaa atttggagac 
4441 acccttgtgt cctccgagat tttccatgca ctcatccatg ataatagcaa tggggccgtg 
4501 ggcagcggcg cgggcaaaca cgttccgtgg gtctgacaca tcatagttat gttcctgagt 
4561 taaatcatca taagccattt taatgaattt ggggcggagc gtaccagatt ggggtatgaa 
4621 tgttccttcg ggccccggag catagttccc ctcacagatt tgcatttccc aagctttcag 
4681 ttctgagggt ggaatcatgt ccacctgggg ggctatgaag aacaccgttt cgggggcggg 
4741 ggtgattagt tgggatgata gcaagtttct gagcaattga gatttgccac atccggtggg 
4801 gccataaata attccgatta caggttgcag gtggtagttt agggaacggc aactgccgtc 
4861 ttctcgaagc aagggggcca cctcgttcat catttccctt acatgcatat tttcccgcac 
4921 caaatccatt aggaggcgct ctcctcctag tgatagaagt tcttgtagtg aggaaaagtt 
4981 tttcagcggt tttagaccgt cagccatggg cattttggaa agagtttgct gcaaaagttc 
5041 tagtctgttc cacagttcag tgatgtgttc tatggcatct cgatccagca gacctcctcg 
5101 tttcgcgggt ttggacggct cc tggagtag ggtatgagac gatgggcgtc cagcgctgcc 
5161 agggttcggt ccttccaggg tctcagtgtt cgagtcaggg ttgtttccgt cacagtgaag 
5221 gggtgtgcgc ctgcttgggc gcttgccagg gtgcgcttca gactcattct gctggtggag 
5281 aacttctgtc gcttggcgcc ctgtatgtcg gccaagtagc agtttaccat gagttcgtag 
5341 ttgagcgcct cggctgcgtg gcctttggcg cggagcttac ctttggaagt tttcttgcat 
5401 accgggcagt ataggcattt cagcgcatac agcttgggcg caaggaaaat ggattctggg 
5461 gagtatgcat ccgcgccgca ggaggcgcaa acagtttcac attccaccag ccaggttaaa 
5521 tccggttcat tggggtcaaa aacaagtttt ccgccatatt ttttgatgcg tttcttacct 
5581 ttggtctcca taagttcgtg tcctcgttga gtgacaaaca ggctgtccgt atctccgtag 
5641 actgatttta caggcctctt ctccagtgga gtgcctcggt cttcttcgta caggaactct 
5701 gaccactctg atacaaaggc gcgcgtccag gccagcacaa aggaggctat gtgggagggg 
5761 tagcgatcgt tgtcaaccag ggggtccacc ttttccaaag tatgcaaaca catgtcaccc 
5821 tcttcaacat ccaggaatgt gattggcttg taggtgtatt tcacgtgacc tggggtcccc 
5881 gctggggggg tataaaaggg ggcggttctt tgctcttcct cactgtcttc cggatcgctg 
5941 tccaggaacg tcagctgttg gggtaggtat tccctctcga aggcgggcat gacctctgca 
6001 ctcaggttgt cagtttctaa gaacgaggag gatttgatat tgacagtgcc ggttgagatg 
6D61 cctttcatga ggttttcgtc catttggtca gaaaacacaa tttttttatt gtcaagtttg 
6121 gtggcaaatg atccatacag ggcgttggak aaaagtttgg caatggatcg catggtttgg 
6181 ttcttttcct tgtccgcgcg ctctttggcg gcgatgttga gttggacata ctcgcgtgcc 
6241 aggcacttcc attcggggaa gatagttgtt aattcatctg gcacgattct- cacttgccac 
6301 cctcgattat gcaaggtaat taaatccaca ctggtggcca cctcgcctcg aaggggttca 
6361 ttggtccaac agagcctacc tcctttccta gaacagaaag ggggaagtgg gtctagcata 
6421 agttcatcgg gagggtctgc atccatggta aagattcccg gaagtaaatc cttatcaaaa 
6481 tagctgatgg gagtggggtc atctaaggcc atttgccatt ctcgagctgc cagtgcgcgc 
6541 tcatatgggt taaggggact gccccagggc atgggatggg tgagagcaga ggcatacatg 
6601 ccacagatgt catagacgta gatgggatcc tcaaagatgc ctatgtaggt tggatagcat 
6661 cgcccccctc tgatacttgc tcgcacatag tcatatagtt catgtgatgg cgctagcagc 
6721 cccggaccca agttggtgcg attgggtttt tctgttctgt agacgatctg gcgaaagatg 
6781 gcgl:gagaat tggaagagat ggtgggtctt tgaaaaatgt tgaaatgggc atgaggtaga 
6841 cctacagagt ctctgacaaa gtgggcataa gattcttgaa gcttggttac cagttcggcg 
6901 gtgacaagta cgtctagggc gcagtagtca agtgtttctt gaatgatgtc ataacctggt 
6961 tggtttttct tttcccacag ttcgcggttg agaaggtatt cttcgcgatc cttccagtac 
7021 tcttctagcg gaaacccgtc tttgtctgca cggtaagatc ctagcatgta gaactgatta 
7081 actgccttgt aagggcagca gcccttctct acgggtagag agtatgcttg agcagctttt 
7141 cgtagcgaag cgtgagtaag ggcaaaggtg tctctgacca tgactttgag aaattggtat 
7201 ttgaagtcca tgtcgtcaca ggctccctgt tcccagagtt ggaagtctac ccgtttcttg 
7261 taggcggggt tgggcaaagc gaaagtaaca tcattgaaga gaatcttacc ggctctgggc 
7321 ataaaattgc gagtgatgcg gaaaggctgt ggtacttccg ctcgattgtt gatcacctgg 
7381 gcagctagga cgatttcgtc gaaaccgttg atgttgtgtc ctacgatgta taattctatg 
7441 aaacgcggcg tgcctctgac gtgaggtagc ttactgagct catcaaaggt taggtctgtg 
7501 gggtcagata aggcgtagtg ttcgagagcc cattcgtgca ggtgaggatt tgcatgtagg 
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7561 aatgatgacc aaagatctac cgccagtgct gtttgtaact ggtcccgata ctgacgaaaa 
7621 tgccggccaa ttgccatttt ttctggagtg acacagtaga aggttctggg gtcttgttgc 
7681 catcgatccc acttgagttt aatggctaga tcgtgggcca tgttgacgag acgctcttct 
7741 cctgagagtt tcatgaccag catgaaagga actagttgtt tgccaaagga tcccatccag 
7801 gtgtaagttt ccacatcgta ggtcaggaag agtctttctg tgcgaggatg agagccgatc 
7861 gggaagaact ggatttcctg ccaccagttg gaggattggc tgttgatgtg atggaagtag 
7921 aagtttctgc ggcgcgccga gcattcgtgt ttgtgcttgt acagacggcc gcagtagtcg 
7981 cagcgttgca cgggttgtat ctcgtgaatg agctgtacct ggcttccctt gacgagaaat 
8041 ttcagtggga- agccgaggcc- tggcgattgt -atctcgtgct cttctatatt cgctgtatcg 
8101 gcctgttcat cttctgtttc gatggtggtc atgctgacga gcccccgcgg gaggcaagtc 
8161 cagacctcgg cgcgggaggg gcggagctga aggacgagag cgcgcaggct ggagctgtcc 
8221 agagtcctga gacgctgcgg actcaggtta gtaggtaggg acagaagatt aacttgcatg 
8281 atcttttcca gggcgtgcgg gaggttcaga tggtacttga tttccacagg ttcgtttgta 
8341 gagacgtcaa tggcttgcag ggttccgtgt cctttgggcg ccactaccgt acctttgttt 
8401 tttcttttga tcggtggtgg ctctcttgct tcttgcatgc tcagaagcgg tgacggggac 
8461 gcgcgccggg cggcagcggt tgttccggac ccgggggcat ggctggtagt ggcacgtcgg 
8521 cgccgcgcac gggcaggttc tggtattgcg ctctgagaag acttgcgtgc gccaccacgc 
8581 gtcgattgac gtcttgtatc tgacgtctct gggtgaaagc taccggcccc gtgagcttga 
8641 acctgaaaga gagttcaaca gaatcaattt cggtatcgtt aacggcagct tgtctcagta 
8701 tttcttgtac gtcaccagag ttgtcctggt aggcgatctc cgccatgaac tgctcgattt 
8761 cttcctcctg aagatctccg cgacccgctc tttcgacggt ggccgcgagg tcattggaga 
8821 tacggcccat gagttgggag aatgcattca tgcccgcctc gttccagacg cggctgtaaa 
8881 ccacggcccc ctcggagtct cttgcgcgca tcaccacctg agcgaggtta agctccacgt 
8941 gtctggtgaa gaccgcatag ttgcataggc gctgaaaaag gtagttgagt gtggtggcaa 
9001 tgtgttcggc gacgaagaaa tacatgatcc atcgtctcag cggcatttcg ctaacatcgc 
9061 ccagagcttc caagcgctcc atggcctcgt agaagtccac ggcaaaatta aaaaactggg 
9121 agtttcgcgc ggacacggtc aattcctcct cgagaagacg gatgagttcg gctatggtgg 
9181 cccgtacttc gcgttcgaag gctcccggga tctcttcttc ctcttctatc tcttcttcca 
9241 ctaacatctc ttcttcgtct tcaggcgggg gcggaggggg cacgcggcga cgtcgacggc 
9301 gcacgggcaa acggtcgatg aatcgttcaa tgacctctcc gcggcggcgg cgcatggttt 
9351 cagtgacggc gcggccgttc tcgcgcggtc gcagagtaaa aacaccgccg cgcatctcct 
9421 taaagtggtg actgggaggt tctccgtttg ggagggagag ggcgctgatt atacatttta 
9481 ttaattggcc cgtagggact gcgcgcagag atctgatcgt gtcaagatcc acgggatctg 
9541 aaaacctttc gacgaaagcg tctaaccagt cacagtcaca aggtaggctg agtacggctt 
9601 cttgtgggcg ggggtggtta tgtgttcggt ctgggtcttc tgtttcttct tcatctcggg 
9661 aaggtgagac gatgctgctg gtgatgaaat taaagtaggc agttctaaga cggcggatgg 
9721 tggcgaggag caccaggtct ttgggtccgg cttgctggat acgcaggcga ttggccattc 
9781 cccaagcatt atcctgacat ctagcaagat ctttgtagta gtcttgcatg agccgttcta 
9841 cgggcacttc ttcctcaccc gttctgccat gcatacgtgt gagtccaaat ccgcgcattg 
9901 gttgtaccag tgccaagtca gctacgactc tttcggcgag gatggcttgc tgtacttggg 
9961 taagggtggc ttgaaagtca tcaaaatcca caaagcggtg gtaagcccct gtattaatgg 
10021 tgtaagcaca gttggccatg actgaccagt taactgtctg gtgaccaggg cgcacgagct 
10081 cggtgtattt aaggcgcgaa taggcgcggg tgtcaaagat gtaatcgttg caggtgcgca 
10141 ccagatactg gtaccctata agaaaatgcg gcggtggttg gcggtagaga ggccatcgtt 
10201 ctgtagctgg agcgccaggg gcgaggtctt ccaacataag gcggtgatag ccgtagatgt 
10261 acctggacat ccaggtgatt cctgcggcgg tagtagaagc ccgaggaaac tcgcgtacgc 
10321 ggttccaaat gttgcgtagc ggcatgaagt agttcattgt aggcacggtt tgaccagtga 
10381 ggcgcgcgca gtcattgatg ctctatagac acggagaaaa tgaaagcgtt cagcgactcg 
10441 actccgtagc ctggaggaac gtgaacgggt tgggtcgcgg tgtaccccgg ttcgagactt 
10501 gtactcgagc cggccggagc cgcggctaac gtggtattgg cactcccgtc tcgacccagc 
10561 ctacaaaaat ccaggatacg gaatcgagtc gttttgctgg tttccgaatg gcagggaagt 
10621 gagtcctatt tttttttttt ttttgccgct cagatgcatc ccgtgctgcg acagatgcgc 
10681 ccccaacaac agcccccctc gcagcagcag cagcagcagc aaccacaaaa ggctgtccct 
10741 gcaactactg caactgccgc cgtgagcggt gcgggacagc ccgcctatga tctggacttg 
10801 gaagagggcg aaggactggc acgtctaggt gcgccttcgc ccgagcggca tccgcgagtt 
10861 caactgaaaa aagattctcg cgaggcgtat gtgccccaac agaacctatt tagagacaga 
10921 agcggcgagg agccggagga gatgcgagct tcccgcttta acgcgggtcg tgagctgcgt 
10981 cacggtttgg accgaagacg agtgttgcga gacgaggatt tcgaagttga tgaagtgaca 
11041 gggatcagtc ctgccagggc acacgtggct gcagccaacc ttgtatcggc ttacgagcag 
11101 acagtaaagg aagagcgtaa cttccaaaag tcttttaata atcatgtgcg aaccctgatt 
11161 gcccgcgaag aagttaccct tggtttgatg catttgtggg atttgatgga agctatcatt 
11221 cagaacccta ctagcaaacc tctgaccgcc cagctgtttc tggtggtgca acacagcaga 
11281 gacaatgagg ctttcagaga ggcgctgctg aacatcaccg aacccgaggg gagatggttg 
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11341 tatgatctta tcaacattct acagagtatc atagtgcagg agcggagcct gggcctggcc 
11401 gagaaggtag ctgccatcaa ttactcggtt ttgagcttgg gaaaatatta cgctcgcaaa 
11461 atctacaaga ctccatacgt tcccatagac aaggaggtga agatagatgg gttctacatg 
11521 cgcatgacgc tcaaggtctt gaccctgagc gatgatcttg gggtgtatcg caatgacaga 
11581 atgcatcgcg cggttagcgc cagcaggagg cgcgagttaa gcgacaggga actgatgcac 
11641 agtttgcaaa gagctctgac tggagctgga accgagggtg agaattactt cgacatggga 
11701 gctgacttgc agtggcagcc tagtcgcagg gctctgagcg ccgcgacggc aggatgtgag 
11761 cttccttaca tagaagaggc ggatgaaggc gaggaggaag agggcgagta cttggaagac 
11821 tgatggcaca acccgtgttt tttgctagat ggaacagcaa gcaccggatc ccgcaatgcg 
11881 ggcggcgctg cagagccagc cgtccggcat taactcctcg gacgattgga cccaggccat 
11941 gcaacgtatc atggcgttga cgactcgcaa ccccgaagcc tttagacagc aaccccaggc 
12001 caaccgtcta tcggccatca tggaagctgt agtgccttcc cgatctaatc ccactcatga 
12061 gaaggtcctg gccatcgtga acgcgttggt ggagaacaaa gctattcgtc cagatgaggc 
12121 cggactggta tacaacgctc tcttagaacg cgtggctcgc tacaacagta gcaatgtgca 
12181 aaccaatttg gaccgtatga taacagatgt acgcgaagcc gtgtctcagc gcgaaaggtt 
12241 ccagcgtgat gccaacctgg gttcgctggt ggcgttaaat gctttcttga gtactcagcc 
12301 tgctaatgtg ccgcgtggtc aacaggatta tactaacttt ttaagtgctt tgagactgat 
12361 ggtatcagaa gtacctcaga gcgsiagtgta tcagtccggt cctgattact tctttcagac 
12421 tagcagacag ggcttgcaga cggtaaatct gagccaagct tttaaaaacc ttaaaggttt 
12481 gtggggagtg catgccccgg taggagaaag agcaaccgtg tctagcttgt taactccgaa 
12541 ctcccgcctg ttattactgt tggtagctcc tttcaccgac agcggtagca tcgaccgtaa 
12601 ttcctatttg ggttacctac taaacctgta tcgcgaagcc atagggcetaa gtcaggtgga 
12661 cgagcagacc tatcaagaaa ttacccaagt cagtcgcgct ttgggacagg aagacactgg 
12721 cagtttggaa gccactctga acttcttgct taccaatcgg tctcaaaaga tccctcctca 
12781 atatgctctt actgcggagg aggagaggat ccttagatat gtgcagcaga gcgtgg^att 
12841 gtttctgatg caagaggggg caactccgac tgcagcactg gacatgacag cgcgaaatat 
12901 ggagcccagc atgtatgcca gtaaccgacc tttcattaac aaactgctgg actacttgca 
12961 cagagctgcc gctatgsiact ctgattattt caccaatgcc atcttaaacc cgcactggct 
13021 gcccccacct ggtttctaca cgggcgaata tgacatgccc gaccctaatg acggatttct 
13081 gtgggacgac gtggacagcg atgttttttc acctctttct gatcatcgca cgtggaaaaa 
13141 ggaaggcggt gatagaatgc attcttctgc atcgctgtcc ggggtcatgg gtgctaccgc 
13201 ggctgagccc gagtctgcaa gtccttttcc tagtctaccc ttttctctac acagtgtacg 
13261 tagcagcgaa gtgggtagaa taagtcgccc gagtttaatg ggcgaagagg agtacctaaa 
13321 cgattccttg ctcagaccgg caagagaaaa aaatttccca aacaatggaa tagaaagttt 
13381 ggtggataaa atgagtagat ggaagactta tgctcaggat cacagagacg agcctgggat 
13441 catggggact acaagtagag cgagccgtag acgccagcgc catgacagac agaggggtct 
13501 tgtgtgggac gatgaggatt cggccgatga tagcagcgtg ttggacttgg gtgggagagg 
13561 aaggggcaac ccgtttgctc atttgcgccc tcgcttgggt ggtatgttgt gaaaaaaaat 
13621 aaaaaagaaa aactcaccaa ggccatggcg acgagcgtac gttcgttctt ctttattatc 
13681 tgtgtctagt ataatgaggc gagtcgtgct aggcggagcg gtggtgtatc cggagggtcc 
13741 tcctccttcg tacgagagcg tgatgcagca gcagcaggcg acggcggtga tgcaatcccc 
13801 actggaggct ccctttgtgc ctccgcgata cctggcacct acggagggca gaaacagcat 
13861 tcgttactcg gaactggcac ctcagtacga taccaccagg ttgtatctgg tggacaacaa 
13921 gtcggcggac attgcttctc tgaactatca gaatgaccac agcaacttct tgaccacggt 
139B1 ggtgcagaac aatgacttta cccctacgga agccagcacc cagaccatta actttgatga 
14041 acgatcgcgg tggggcggtc agctaaagac catcatgcat actaacatgc caaacgtgaa 
14101 cgagtatatg tttagtaaca agttcaaagc gcgtgtgatg gtgtccagaa aacctcccga 
14161 cggtgctgca gttggggata cttatgatca caagcaggat attttggaat atgagtggtt 
14221 cgagtttact ttgccagaag gcaacttttc agttactatg actattgatt tgatgaacaa 
14281 tgccatcata gataattact tgaaagtggg tagacagaat ggagtgcttg aaagtgacat 
14341 tggtgttaag ttcgacacca ggaacttcaa gctgggatgg gatcccgaaa ccaagttgat 
14401 catgcctgga gtgtatacgt atgaagcctt ccatcctgac attgtcttac tgcctggctg 
14461 cggagtggat tttaccgaga gtcgtttgag caaccttctt ggtatcagaa aaaaacagcc 
14521 atttcaagag ggttttaaga ttttgtatga agatttagaa ggtggtaata ttccggccct 
14581 cttggatgta gatgcctatg agaacagtaa gaaagaacaa aaagccaaaa tagaagctgc 
14641 tacagctgct gcagaagcta aggcaaacat agttgccagc gactctacaa gggttgctaa 
14701 cgctggagag gtcagaggag acaattttgc gccaacacct gttccgactg cagaatcatt 
14761 attggccgat gtgtctgatg gaacggacgt gaaactcact attcaacctg tagaaaaaga 
14821 tagtaagaat agaagctata atgtgttgga agacaaaatc aacacagcct atcgcagttg 
14881 gtatctttcg tacaattatg gcgatcccga aaaaggagtg cgttcctgga cattgctcac 
14941 cacctcagat gtcacctgcg gagcagagca ggtttactgg tcgcttccag acatgatgaa 
15001 ggatcctgtc actttccgct ccactagaca agtcagtaac taccctgtgg tgggtgcaga 
15061 gcttatgccc gtcttctcaa agagcttcta caacgaacaa gctgtgtact cccagcagct 
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15121 ccgccagtcc acctcgctta cgcacgtctt caaccgcttt cctgagaacc agattttaat 
15181 ccgtccgccg gcgcccacca ttaccaccgt cagtgaaaac gttcctgctc tcacagatca 
15241 cgggaccctg ccgttgcgca gcagtatccg gggagtccaa cgtgtgaccg ttactgacgc 
15301 cagacgccgc acctgtccct acgtgtacaa ggcactgggc atagtcgcac cgcgcgtcct 
15361 ttcaagccgc actttctaaa aaaaaaatgt ccattcttat ctcgcccagt aataacaccg 
15421 gttggggtct gcgcgctcca agcaagatgt acggaggcgc acgcaaacgt tctacccaac 
15481 atcccgtgcg tgttcgcgga cattttcgcg ctccatgggg tgccctcaag ggccgcactc 
15541 gcgttcgaac caccgtcgat gatgtaatcg atcaggtggt tgccgacgcc cgtaattata 
15601 ctcctactgc gcctacatct-actgtggatg -cagttattga.cagi:g.tagtg_gctgacgctc. . . 
15661 gcaactatgc tcgacgtaag agccggcgaa ggcgcattgc cagacgccac cgagctacca 
15721 ctgccatgcg agccgcaaga gctctgctac gaagagctag acgcgtgggg cgaagagcca 
15781 tgcttagggc ggccagacgt gcagcttcgg gcgccagcgc cggcaggtcc cgcaggcaag 
15841 cagccgctgt cgcagcggcg actattgccg acatggccca atcgcgaaga ggcaatgtat 
15901 actgggtgcg tgacgctgcc accggtcaac gtgtacccgt gcgcacccgt ccccctcgca 
15961 cttagaagat actgagcagt ctccgatgtt gtgtcccagc ggcgaggatg tccaagcgca 
16021 aatacaagga agaaatgctg caggttatcg cacctgaagt ctacggccaa ccgttgaagg 
16081 atgaaaaaaa accccgcaaa atcaagcggg ttaaaaagga caaaaaagaa gaggaagatg 
16141 gcgatgatgg gctggcggag tttgtgcgcg agtttgcccc acggcgacgc gtgcaatggc 
16201 gtgggcgcaa agttcgacat gtgttgagac ctggaacttc ggtggtcttt acacccggcg 
16261 agcgttcaag cgctactttt aagcgttcct atgatgaggt gtacggggat gatgatattc 
16321 ttgagcaggc ggctgaccga ttaggcgagt ttgcttatgg caagcgtagt agaataactt 
16381 ccaaggatga gacagtgtca atacccttgg atcatggaaa tcccacccct agtcttaaac 
16441 cggtcacttt gcagcaagtg ttacccgtaa ctccgcgaac aggtgttaaa cgcgaaggtg 
16501 aagatttgta tcccactatg caactgatgg tacccaaacg ccagaagttg gaggacgttt 
16561 tggagaaagt aaaagtggat ccagatattc aacctgaggt taaagtgaga cccattaagc 
16621 aggtagcgcc tggtctgggg gtacaaactg tagacattaa gattcccact gaaagt^tgg • 
16681 aagtgcaaac tgaacccgca aagcctactg ccacctccac tgaagtgcaa acggatccat 
16741 ggatgcccat gcctattaca actgacgccg ccggtcccac tcgaagatcc cgacgaaagt 
16B01 acggtccagc aagtctgttg atgcccaatt atgttgtaca cccatctatt attcctactc 
16861 ctggttaccg aggcactcgc tactatcgca gccgaaacag tacctcccgc cgtcgccgca . 
16921 agacacctgc aaatcgcagt cgtcgccgta gacgcacaag caaaccgact cccggcgccc 
16981 tggtgcggca agtgtaccgc aatggtagtg cggaaccttt gacactgccg cgtgcgcgtt 
17041 accatccgag tatcatcact taatcaatgt tgccgctgcc tccttgcaga tatggccctc 
17101 acttgtcgcc ttcgcgttcc catcactggt taccgaggaa gaaactcgcg ccgtagaaga 
17161 gggatgttgg gacgcggaat gcgacgctac aggcgacggc gtgctatccg caagcaattg 
17221 cggggtggtt ttttaccagc cttaattcca attatcgctg ctgcaattgg cgcgatacca 
17281 ggcatagctt ccgtggcggt tcaggcctcg caacgacatt gacattggaa aaaaaacgta 
17341 taaataaaaa aaaatacaat ggactctgac actcctggtc ctgtgactat gttttcttag 
17401 agatggaaga catcaatttt tcatccttgg ctccgcgaca cggcacgaag ccgtacatgg 
17461 gcacctggag cgacatcggc acgagccaac tgaacggggg cgccttcaat tggagcagta 
17521 tctggagcgg gcttaaaaat tttggctcaa ccataaaaac atacgggaac aaagcttgga 
17581 acagcagtac aggacaggcg cttagaaata aacttaaaga ccagaacttc caacaaaaag 
17641 tagtcgatgg gatagcttcc ggcatcaatg gagtggtaga tttggctaac caggctgtgc 
17701 agaaaaagat aaacagtcgt ttggacccgc cgccagcaac cccaggtgaa atgcaagtgg 
17761 aggaagaaat tcctccgcca gaaaaacgag gcgacaagcg tccgcgtccc gatttggaag 
17821 agacgctggt gacgcgcgta gatgaaccgc cttcttatga ggaagcaacg aagcttggaa 
17881 tgcccaccac tagaccgata gccccaatgg ccaccggggt gatgaaacct tctcagttgc 
17941 atcgacccgt caccttggat ttgccccctc cccctgctgc tactgctgta cccgcttcta 
18001 agcctgtcgc tgccccgaaa ccagtcgccg tagccaggtc acgtcccggg ggcgctcctc 
18061 gtccaaatgc gcactggcaa aatactctga acagcatcgt gggtctaggc gtgcaaagtg 
18121 taaaacgccg tcgctgcttt taattaaata tggagtagcg cttaacttgc ctatctgtgt 
18181 atatgtgtca ttacacgccg tcacagcagc agaggaaaaa aggaagaggt cgtgcgtcga 
18241 cgctgagtta ctttcaagat ggccacccca tcgatgctgc cccaatgggc atacatgcac 
18301 atcgccggac aggatgcttc ggagtacctg agtccgggtc tggtgcagtt cgcccgcgcc 
18361 acagacacct acttcaatct gggaaataag tttagaaatc ccaccgtagc gccgacccac 
18421 gatgtgacca ccgaccgtag ccagcggctc atgttgcgct tcgtgcccgt tgaccgggag 
18481 gacaatacat actcttacaa agtgcggtac accctggccg tgggcgacaa cagagtgctg 
18541 gatatggcca gcacgttctt tgacattagg ggcgtgttgg acagaggtcc cagtttcaaa 
18601 ccctattctg gtacggctta caactctctg gctcctaaag gcgctccaaa tgcatctcaa 
18661 tggattgcaa aaggcgtacc aactgcagca gccgcaggca atggtgaaga agaacatgaa 
18721 acagaggaga aaactgctac ttacactttt gccaatgctc ctgtaaaagc cgaggctcaa 
18781 attacaaaag agggcttacc aataggtttg gagatttcag ctgaaaacga atctaaaccc 
18841 atctatgcag ataaacttta tcagccagaa cctcEiagtgg gagatgaaac ttggactgac 
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18901 ctagacggaa aaaccgaaga gtatggaggc agggctctaa agcctactac taacatgaaa 
18961 ccctgttacg ggtcctatgc gaagcctact aatttaaaag gtggtcaggc aaaaccgaaa 
19021 aactcggaac cgtcgagtga aaaaattgaa tatgatattg acatggaatt ttttgataac 
19081 tcatcgcaaa gaacaaactt cagtcctaaa attgtcatgt atgcagaaaa tgtaggtttg 
19141 gaaacgccag acactcatgt agtgtacaaa cctggaacag aagacacaag ttccgaagct 
19201 aatttgggac aacagtctat gcccaacaga cccaactaca ttggcttcag agataacttt 
19261 attggactca tgtactataa cagtactggt aacatggggg tgctggctgg tcaagcgtct 
19321 cagttaaatg cagtggttga cttgcaggac agaaacacag aactttctta ccaactcttg 
19381 cttgactctc tgggcgacag aaccagatac tttagcatgt ggaatcaggc tgtggacagt 
19441 tatgatcctg atgtacgtgt tattgaaaat catggtgtgg aagatgaact tcccaactat 
19501 tgttttccac tggacggcat aggtgttcca acaaccagtt acaaatcaat agttccaaat 
19561 ggagaagata ataataattg gaaagaacct gaagtaaatg gaacaagtga gatcggacag 
19621 ggtaatttgt ttgccatgga aattaacctt caagccaatc tatggcgaag tttcctttat 
19681 tccaatgtgg ctctgtatct cccagactcg tacaaataca ccccgtccaa tgtcactctt 
19741 ccagaaaaca aaaacaccta cgactacatg aacgggcggg tggtgccgcc atctctagta 
19801 gacacctatg tgaacattgg tgccaggtgg tctctggatg ccatggacaa tgtcaaccca 
19861 ttcaaccacc accgtaacgc tggcttgcgt taccgatcta tgcttctggg taacggacgt 
19921 tatgtgcctt tccacataca agtgcctcaa aaattcttcg ctgttaaaaa cctgctgctt 
19981 ctcccaggct cctacactta tgagtggaac tttaggaagg atgtgaacat ggttctacag 
20041 agttccctcg gtaacgacct gcgggtagat ggcgccagca tcagtttcac gagcatcaac 
20101 ctctatgcta cttttttccc catggctcac aacaccgctt ccacccttga agccatgctg 
20161 cggaatgaca ccaatgatca gtcattcaac gactacctat ctgcagctaa catgctctac 
20221 cccattcctg ccaatgcaac caatattccc atttccattc cttctcgcaa ctgggcggct 
20281 ttcagaggct ggtcatttac cagactgaaa accaaagaaa ctccctcttt ggggtctgga 
20341 tttgacccct actttgtcta ttctggttct attccctacc tggatggtac cttctacctg 
204O1 aaccacactt ttaagaaggt ttccatcatg tttgactctt cagtgagctg gcctggaaat 
20461 gacaggttac tatctcctaa cgaatttgaa ataaagcgca ctgtggatgg cgaaggctac 
20521 aacgtagccc aatgcaacat gaccaaagac tggttcttgg tacagatgct cgccaactac 
20581 aacatcggct atcagggctt ctacattcca gaaggataca aagatcgcat gtattcattt 
20641 ttcagaaact tccagcccat gagcaggcag gtggttgatg aggtcaatta caaagacttc 
20701 aaggccgtcg ccatacccta ccaacacaac aactctggct ttgtgggtta catggctccg 
20761 accatgcgcc aaggtcaacc ctatcccgct aactatccct atccactcat tggaacaact 
20821 gccgtaaata gtgttacgca gaaaaagttc ttgtgtgaca gaaccatgtg gcgcataccg 
20881 ttctcgagca acttcatgtc tatgggggcc cttacagact tgggacagaa tatgctctat 
20941 gccaactcag ctcatgctct ggacatgacc tttgaggtgg atcccatgga tgagcccacc 
21001 ctgctttatc ttctcttcga agttttcgac gtggtcagag tgcatcagcc acaccgcggc 
21061 atcatcgagg cagtctacct gcgtacaccg ttctcggccg gtaacgctac cacgtaagaa 
21121 gcttcttgct tcttgcaaat agcagctgca accatggcct gcggatccca aaacggctcc 
21181 agcgagcaag agctcagagc cattgtccaa gacctgggtt gcggacccta ttttttggga 
21241 acctacgata agcgcttccc ggggttcatg gcccccgata agctcgcctg tgccattgta 
21301 aatacggccg gacgtgagac ggggggagag cactggttgg ctttcggttg gaacccacgt 
213 61 tctaacacct gctacctttt tgatcctttt ggattctcgg atgatcgtct caaacagatt 
21421 taccagtttg aatatgaggg tctcctgcgc cgcagcgctc ttgctaccaa ggaccgctgt 
21481 attacgctgg aaaaatctac ccagaccgtg cagggccccc gttctgccgc ctgcggactt 
21541 ttctgctgca tgttccttca cgcctttgtg cactggcctg accgtcccat ggacggaaac 
21601 cccaccatga aattgctaac tggagtgcca aacaacatgc ttcattctcc taaagtccag 
21661 cccaccctgt gtgacaatca aaaagcactc taccattttc ttaataccca ttcgccttat 
21721 tttcgctctc atcgtacaca catcgaaagg gccactgcgt tcgaccgtat ggatgttcaa 
21781 taatgactca tgtaaacaac gtgttcaata aacatcactt tattttttta catgtatcaa 
21841 ggctctggat tacttattta tttacaagtc gaatgggttc tgacgagaat cagaatgacc 
21901 cgcaggcagt gatacgttgc ggaactgata cttgggttgc cacttgaatt cgggaatcac 
21961 caacttggga accggtatat. cgggcaggat gtcactccac agctttctgg tcagctgcaa 
22.021 agctccaagc aggtcaggag ccgaaatctt gaaatcacaa ttaggaccag tgctctgagc 
22081 gcgagagttg cggtacaccg gattgcagca ctgaaacacc atcagcgacg gatgtctcac 
22141 gcttgccagc acggtgggat ctgcaatcat gcccacatcc agatcttcag cattggcaat 
22201 gctgaacggg gtcatcttgc aggtctgcct acccatggcg ggcacccaat taggcttgtg 
22261 gttgcaatcg cagtgcaggg ggatcagtat catcttggcc tgatcctgtc tgattcctgg 
22321 atacacggct ctcatgaaag catcatattg cttgaaagcc tgctgggctt tactaccctc 
22381 ggtataaaac atcccgcagg acctgctcga aaactggtta gctgcacagc cggcatcatt 
. 22441 cacacagcag cgggcgtcat tgttggctat ttgcaccaca cttctgcccc agcggttttg 
22501 ggtgattttg gttcgctcgg gattctcctt taaggctcgt tgtccgttct cgctggccac 
22561 atccatctcg ataatctgct ccttctgaat cataatattg ccatgcaggc acttcagctt 
22621 gccctcataa tcattgcagc catgaggcca caacgcacag cctgtacatt cccaattatg 
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22681 gtgggcgatc tgagaaaaag aatgtatcat tccctgcaga aatcttccca tcatcgtgct 
22741 cagtgtcttg tgactagtga aagttaactg gatgcctcgg tgctcttcgt ttacgtactg 
22801 gtgacagatg cgcttgtatt gttcgtgttg ctcaggcatt agtttaaaac aggttctaag 
22861 ttcgttatcc agcctgtact tctccatcag cagacacatc acttccatgc ctttctccca 
22921 agcagacacc aggggcaagc taatcggatt cttaacagtg caggcagcag ctcctttagc 
22981 cagagggtca tctttagcga tcttctcaat gcttcttttg ccatccttct caacgatgcg 
23041 cacgggcggg tagctgaaac ccactgctac aagttgcgcc tcttctcttt cttcttcgct 
23101 gtcttgactg atgtcttgca tggggatatg tttggtcttc cttggcttct ttttgggggg 
23161 tatcggagga ggaggactgt cgctccgttc cggagacagg gaggattgtg acgtttcgct. 
23221 caccattacc aactgactgt cggtagaaga icctgacccc acacggcgac aggtgttttt 
23281 cttcgggggc agaggtggag gcgattgcga agggctgcgg tccgacctgg aaggcggatg 
23341 actggcagaa ccccttccgc gttcgggggt gtgctccctg tggcggtcgc ttaactgatt 
23401 tccttcgcgg ctggccattg tgttctccta ggcagagaaa caacagacat ggaaactcag 
23461 ccattgctgt caacatcgcc acgagtgcca tcacatctcg tcctcagcga cgaggaaaag 
23521 gagcagagct taagcattcc accgcccagt cctgccacca cctctaccct agaagataag 
23581 gaggtcgacg catctcatga catgcagaat aaaaaagcga aagagtctga gacagacatc 
23641 gagcaagacc cgggctatgt gacaccggtg gaacacgagg aagagttgaa acgctttcta 
23701 gagagagagg atgaaaactg cccaaaacag cgagcagata actatcacca agatgctgga 
23761 aatagggatc agaacaccga ctacctcata gggcttgacg gggaagacgc gctccttaaa 
23821 catctagcaa gacagtcgct catagtcaag gatgcattat tggacagaac tgaagtgccc 
23881 atcagtgtgg aagagctcag ctgcgcctac gagcttaacc ttttttcacc tcgtactccc 
23941 cccaaacgtc agccaaacgg cacctgcgag ccaaatcctc gcttaaactt ttatccagct 
24001 tttgctgtgc cagaagtact ggctacctat cacatctttt ttaaaaatca aaaaattcca 
24061 gtctcctgcc gcgctaatcg cacccgcgcc gatgccctac tcaatctggg acctggttca 
24121 cgcttacctg atatagcttc cttggaagag gttccaaaga tcttcgaggg tctgggcaat 
24181 aatgagactc gggccgcaaa tgctctgcaa aagggagaaa atggcatgga tgagcatcac 
24241 agcgttctgg tggaattgga aggcgataat gccagactcg cagtactcaa gcgaagcgtc 
24301 gaggtcacac acttcgcata tcccgctgtc aacctgcccc ctaaagtcat gacggcggtc 
24361 atggaccagt tactcattaa gcgcgcaagt cccctttcag aagacatgca tgacccagat 
24421 gcctgtgatg agggtaaacc agtggtcagt gatgagcagc taacccgatg gctgggcacc 
24481 gactctcccc gggatttgga agagcgtcgc aagcttatga tggccgtggt gctggttacc 
24541 gtagaactag agtgtctccg acgtttcttt accgattcag aaaccttgcg caaactcgaa 
24601 gagaatctgc actacacttt tagacacggc tttgtgcggc aggcatgcaa gatatctaac 
24661 gtggaactca ccaacctggt ttcctacatg ggtattctgc atgagaatcg cctaggacaa 
24721 agcgtgctgc acagcaccct taagggggaa gcccgccgtg attacatccg cgattgtgtc 
24781 tatctctacc tgtgccacac gtggcaaacc ggcatgggtg tatggcagca atgtttagaa 
24841 gaacagaact tgaaagagct tgacaagctc ttacagaaat ctcttaaggt tctgtggaca 
24901 gggttcgacg agcgcaccgt cgcttccgac ctggcagacc tcatcttccc agagcgtctc 
24961 agggttactt tgcgaaacgg attgcctgac tttatgagcc agagcatgct taacaatttt 
25021 cgctctttca tcctggaacg ctccggtatc ctgcccgcca cctgctgcgc actgccctcc 
25081 gactttgtgc ctctcaccta ccgcgagtgc cccccgccgc tatggagtca ctgctacctg 
25141 ttccgtctgg ccaactatct ctcctaccac tcggatgtga tcgaggatgt gagcggagac 
25201 ggcttgctgg agtgccactg ccgctgcaat ctgtgcacgc cccaccggtc cctagcttgc 
25261 aacccccagt tgatgagcga aacccagata ataggcacct ttgaattgca aggccccagc 
25321 agccaaggcg atgggtcttc tcctgggcaa agtttaaaac tgaccccggg actgtggacc 
25381 tccgcctact tgcgcaagtt tgctccggaa gattaccacc cctatgaaat caagttctat 
25441 gaggaccaat cacagcctcc aaaggccgaa ctttcggctt gcgtcatcac ccagggggca 
25501 attctggccc aattgcaagc catccaaaaa tcccgccaag aatttctact gaaaaagggt 
25561 aagggggtct accttgaccc ccagaccggc gaggaactca acacaaggtt ccctcaggat 
25621 gtcccaacga cgagaaaaca agaagttgaa ggtgcagccg ccgcccccag aagatatgga 
25681 ggaagattgg gacagtcagg cagaggaggc ggaggaggac agtctggagg acagtctgga 
25741 ggaagacagt ttggaggagg aaaacgagga ggcagaggag gtggaagaag taaccgccga 
25801 caaacagtta tcctcggctg cggagacaag caacagcgct accatctccg ctccgagtcg 
25861 aggaacccgg cggcgtccca gcagtagatg ggacgagacc ggacgcttcc cgaacccaac 
25921 cagcgcttcc aagaccggta agaaggatcg gcagggatac aagtcctggc gggggc^taa 
25981 gaatgccatc atctcctgct tgcatgagtg cgggggcaac atatccttca cgcggcgcta 
26041 cttgctattc caccatgggg tgaactttcc gcgcaatgtt ttgcattact accgtcacct 
26101 ccacagcccc tactatagcc agcaaatccc gacagtctcg acagataaag acagcggcgg 
26161 cgacctccaa cagaaaacca gcagcggcag ttagaaaata .cacaacaagt gcagcaacag 
26221 gaggattaaa gattacagcc aacgagccag cgcaaacccg agagttaaga aatcggatct 
26281 ttccaaccct gtatgccatc ttccagcaga gtcggggtca agagcaggaa ctgaaaataa 
26341 aaaaccgatc tctgcgttcg ctcaccagaa gttgtttgta tcacaagagc gaagatcaac 
26401 ttcagcgcac tctcgaggac gccgaggctc tcttcaacaa gtactgcgcg ctgactctta 
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55461 aaaaataaoc agcgaccgcg cttattcaaa aaaggcggga attacatcat cctcgapatg 
agtalagala ?tcccacgcc ttacatgtgg agttatcaac cccaaatggg fttggcagca 
2S581 aacacctccc aggactactc cacccgcatg aattggctca gcgccgggcc ttctatgatt 
mil tctSagttI ttgatatacg cgcctaccga aaccaaatac ttttggaaca ^tcagctctt 
2G701 accaccacgc cccgccaaca ccttaatocc agaaattggc ccgccgccct agtgtaccag 
ll^ll allScccg ctcccaccac tgtattactt cctcgagacg cccaggccga agtccaaatg 
ofiB9i frtaftacaa atacgcagtt agctggcggc tccaccctat gtcgtcacag gcctcggcat 
IgSII Sttelltl ?clSa?gat caga||cc|a ggtatccagc tcaacgacga gtcggtgagc 
26141 tcKScttg rtltlcglcc agacggaatc tttcagattg ccggctgcgg gagatcttcc 
llloi ttcacfcctf Scaggctgt tctgactttg gaaagttcgt cttcgcaacc ccgctcgggc 
lllli aaaatcggga ccgttlaatt tgtagaggag tttactccct ctgtctactt caaccccttc 
27121 ?ccgS?itc c?gggcacta cccggacgag ttcataccga acttcgacgc gattagcgag 
27181 Sagtgglcg gctlcgattg atgtctggtg acgcggctga gctatctcgg ctgcgacatc 
27241 taSccIctg Icgcclcttl cgctgctttg cocgggaact tattgagttc atctacttcg 
27301 al^tcctcal ggltclccct caaggtccgg cccacggagt gcggattact atcgaaggca 
27361 aaataaactc tcgcctgcaa cgaattttct cccagcggcc cgtgctgatc gagcgagacc 
2742^ aaaSalacal cacStttcc atetactgca tttgtaatca ccccggattg catgaaagcc 
Hill tttlltstct ttStgtact gagtttaita aaaactgaat taagactctc ctacggactg 
27541 cSct^Stc atclclgatt tticaaccag aagaacaaaa cttttcctgt cgtccaggac 
27601 tctgtSact toacctttcc tactcacaaa ctagaagctc aacgactaca ccgcttttcc 
27661 a^Iacattt tccctactaa tactactttc aaaaccggag gtgagctcca cggtctccct 
2?"i aSllaacc cttgggtgga agcgggcctt gtagtactag gaattcttgc ggSftgggott 
^7781 ataattattc tttgctacct atacacacct tgcttcactt tcctagtggt gttgtggtat 
Villi ?aattttala aatggggccc atactagtct tgcttgtttt actttcgctt ttggaaccgg 
llltl SStgccal ttacgltcca tgtctagact ttgacccaga aaactgcaca cttacttttg 
27961 cacccgacac aagccgcatc tgtggagttc ttattaagtg cggatgggaa tgcaggtccg 
28021 ttaaalttac acacaitaac aaaacctgga acaatacctt atccaccaca tgggagccag 
28081 gSt^lccga Jtggtacact gtctctgtcc gaggtcctga cggttccatc cgcattagta 
28141 Icaacacttt lattttttct gaaatgtgcg atctggccat gttcatgagc ^aacagtatt 
28201 ctctatggcc tcctagcaag gacaacatcg taacgttctc cattgcttat tgcttgtgcg 
28261 ittScttlt tactgcttta ctgtgcgtat gcatacacct gcttgtaacc actcgcatca 
mil l^cgccL taaclaagaa aaaatgcctt aacctctttc tgtttacaga catggcttct 
28381 cttacatctc tcatatttgt cagcattgtc actgccgctc acggacaaac agtcgtctct 
lllll atcccaclag gacataatla cactctcata ggacccccaa tcacttcaga ggtcatctgg 
28501 aclalactgg gaagcgttga ttactttgat ataatctgta acaaaacaaa accaataata 
IIIgi at^cttgca Icatacaaaa tcttacattg attaatgtta gcaaagttta cagcggttac 
2lMi tawltggtt atgacagata cagtagtcaa tatagaaatt acttggttcg tgttacccag 
2868i ??gaaalcca cglaaatgcc aaatatggca aagattcgat ccgatgacaa ttctctagaa 
2875i acttttacat ctcccaccac acccgacgaa aaaaacatcc cagattcaat g^tgcaatt 
•jftani nttacaacaa taacagtggt gatggcacta ataataatat gcatgctttt atatgcccgt 
mil cacfacim agtttlatic taailaacaa gatctcctac taaggcttaa catttaattt 
28921 cttttttS^ agccatggtt tccactacca cattccttat gcttactagt ctcgcaactc 
28981 toacttctgc tcgctcacac ctcactgtaa ctataggctc aaactgcaca ctaaaaggac 
29041 ctlSgSgg tcltgtcttt tggtggagaa tatatgacaa tggatggttt acaaaaccat 
29101 5S"cSl5 tggtagattt ttctgcaacg gcagagacct aaccattatc aacgtgacag 
29161 calatgacaa alicttctat tatggaaccg actataaaag tagtttagat tataacatta 
29221 ttgtactgcc atctaccact ccagcacccc gcacaactac tttctctagc agcagtgtcg 
29281 ctaacaatac aatttccaat ccaacctttg ccgcgctttt aaaacgcact gtgaataatt 
29341 ctacaacttc acatacaaca atttccactt caacaatcag catcatcgct gcagtgacaa 
29401 ttggaatatc tattcttgtt tttaccataa cctactacgc ctgctgctat agaaaagaca 
29461 aalSaaagg tgatccatta cttagatttg atatttaatt tgttcttttt ttttatttac 
29521 StatggtS alaccaatca tggtacctag aaatttcttc ttcaccatac tcatctgtgc 
29581 ttttaatgtt tgcgctactt tcacagcagt agccacagca accccagact gtataggagc 
29641 atttgcttcc tatgcacttt ttgcttttgt tacttgcatc tgcgtatgta gcatagtctg 
29701 cctggttatt aattttttcc aacttctaga ctggatcctt gtgcgaattg cctacctgcg 
29761 ccaccatccc gaataccgca accaaaatat cgcggcactt cttagactca tctaaaacca 
29821 tocaggctat actaccaata tttttgcttc tattgcttcc ctacgctgtc tcaaccccag 
29881 ctGCCtatag tactccacca gaacacctta gaaaatgcaa attccaacaa ccgtggtcat 
29941 ttcttgcttg ctatcgagaa aaatcagaaa tccccccaaa tttaataatg attgctggaa 
30001 taattaatat aatctgttgc accataattt catttttgat atacccccta tttgattttg 
30061 crctggaatgc tcccaatgca catgatcatc cacaagaccc agaggaacac attcccccac 
30121 aaaacatgca acatccaata gcgctaatag attacgaaag tgaaccacaa cccccactac 
30181 tccctgctat tagttacttc aacctaaccg gcggagatga ctgaaacact caccacctcc 
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30241 aattccgccg aggatctgct cgatatggac ggccgcgtct cagaacaacg acttgcccaa 
30301 ctacgcatcc gccagcagca ggaacgcgtg gccaaagagc tcagagatgt catccaaatt 
30361 caccaatgca aaaaaggcat attctgtttg gtaaaacaag ccaagatatc ctacgagatc 
30421 accgctactg accatcgcct ctcttacgaa cttggccccc aacgacaaaa atttacctgc 
30481 atggtgggaa tcaaccccat agttatcacc caacaaagtg gagatactaa gggttgcatt 
30541 cactgctcct gcgattccat cgagtgcacc tacaccctgc tgaagaccct atgcggccta 
30601 agagacctgc taccaatgaa ttaaaaaaaa atgattaata aaaaatcact tacttgaaat 
30661 cagcaataag gtctctgttg aaattttctc ccagcagcac ctcacttccc tcttcccaac 
30721 tctggtattc taaaccccgt tcagcggcat actttctcca tactttaaag gggatgtcaa 
30781 attttagctc ctctcctgta cccacaatct tcatgtcttt cttcccagat gaccaagaga 
30841 gtccggctca gtgactcctt caaccctgtc tacccctatg aagatgaaag cacctcccaa 
3O901 caccccttta taaacccagg gtttatttcc ccaaatggct tcacacaaag cccagacgga 
30961 gttcttactt taaaatgttt aaccccacta acaaccacag gcggatctct acagctaaaa 
31021 gtgggagggg gacttacagt ggatgacact gatggtacct tacaagaaaa catacgtgct 
31081 acagcaccca ttactaaaaa taatcactct gtagaactat ccattggaaa tggattagaa 
31141 actcaaaaca ataaactatg tgccaaattg ggaaatgggt taaaatttaa caacggtgac 
31201 atttgtataa aggatagtat taacacctta tggactggaa taaaccctcc acctaactgt 
31261 caaattgtgg aaaacactaa tacaaatgat ggcaaactta ctttagtatt agtaaaaaat 
31321 ggagggcttg ttaatggcta cgtgtctcta gttggtgtat cagacactgt gaaccaaatg 
31381 ttcacacaaa agacagcaaa catccaatta agattatatt ttgactcttc tggaaatcta 
31441 ttaactgagg aatcagactt aaaaattcca cttaaaaata aatcttctac agcgaccagt 
31501 gaaactgtag ccagcagcaa agcctfctatg ccaagtacta cagcttatcc cttcaacacc 
31561 actactaggg atagtgaaaa ctacattcat ggaatatgtt actacatgac tagttatgat 
31621 agaagtctat .ttcccttgaa catttctata atgctaaaca gccgtatgat ttcttccaat 
31681 gttgcctatg ccatacaatt tgaatggaat ctaaatgcaa gtgaatctcc agaaagcaac 
31741 atagctacgc tgaccacatc cccctttttc ttttcttaca ttacagaaga cgacaactaa 
31801 aataaagttt aagtgttttt atttaaaatc acaaaattcg agtagttatt ttgcctccac 
31861 cttcccattt gacagaatac accaatctct ccccacgcac agctttaaac atttggatac 
31921 cattagagat agacattgtt ttagattcca cattccaaac agtttcagag cgagccaatc 
31981 tggggtcagt gatagataaa aatccatcgc gatagtcttt taaagcgctt tcacagtcca 
32041 actgctgcgg atgcgactcc ggagtttgga tcacggtcat ctggaagaag aacgatggga 
32101 atcataatcc gaaaacggta tcggacgatt gtgtctcatc aaacccacaa gcagccgctg 
32161 tctgcgtcgc tccgtgcgac tgctgtttat gggatcaggg tccacagttt cctgaagcat 
32221 gattttaata gcccttaaca tcaactttct ggtgcgatgc gcgcagcaac gcattctgat 
32281 ttcactcaaa tctttgcagt aggtacaaca cattattaca atattgttta ataaaccata 
32341 attaaaagcg ctccagccaa aactcatatc tgatataatc gcccctgcat gaccatcata 
32401 ccaaagttta atataaatta aatgacgttc cctcaaaaac acactaccca catacatgat 
32461 ctcttttggc atgtgcatat taacaatctg tctgtaccat ggacaacgtt ggttaatcat 
32521 gcaacccaat ataaccttcc ggaaccacac tgccaacacc gctcccccag ccatgcattg 
32581 aagtgaaccc tgctgattac aatgacaatg aagaacccaa ttctctcgac cgtgaatcac 
32641 ttgagaatga aaaatatcta tagtggcaca acatagacat aaatgcatgc atcttctcat 
32701 aatttttaac tcctcaggat ttagaaacat atcccaggga ataggaagct cttgcagaac 
32761 agtaaagctg gcagaacaag gaagaccacg aacacaactt acactatgca tagtcatagt 
32821 atcacaatct ggcaacagcg ggtggtcttc agtcatagaa gctcgggttt cattttcctc 
32881 acaacgtggt aactgggctc tggtgtaagg gtgatgtctg gcgcatgatg tcgagcgtgc 
32941 gcgcaacctt gtcataatgg agttgcttcc tgacattctc gtattttgta tagcaaaacg 
33001 cggccctggc agaacacact cttcttcgcc ttctatcctg ccgcttagcg tgttccgtgt 
33061 gatagttcaa gtacagccac actcttaagt tggtcaaaag aatgctggct tcagttgtaa 
33121 tcaaaactcc atcgcatcta attgttctga ggaaatcatc cacggtagca tatgcaaatc 
33181 ccaaccaagc aatgcaactg gattgcgttt caagcaggag aggagaggga agagacggaa 
33241 gaaccatgtt aatttttatt ccaaacgatc tcgcagtact tcaaattgta gatcgcgcag 
33301 atggcatctc tcgcccccac tgtgttggtg aaaaagcaca gctaaatcaa aagaaatgcg 
33361 attttcaagg tgctcaacgg tggcttccaa caaagcctcc acgcgcacat ccaagaacaa 
33421 aagaatacca aaagaaggag cattttctaa ctcctcaatc atcatattac attcctgcac 
33481 cattcccaga taattttcag ctttccagcc ttgaattatt cgtgtcagtt cttgtggtaa 
33541 atccaatcca cacattacaa acaggtcccg gagggcgccc tccaccacca ttcttaaaca 
33601 caccctcata atgacaaaat atcttgctcc tgtgtcacct gtagcgaatt gagaatggca 
33661 acatcaattg acatgccctt ggctctaagt tcttctttaa gttctagttg taaaaactct 
33721 ctcatattat caccaaactg cttagccaga agccccccgg gaacaagagc aggggacgct 
33781 acagtgcagt acaagcgcag acctccccaa ttggctccag caaaaacaag attggaataa 
33841 gcatattggg aaccaccagt aatatcatcg aagttgctgg aaatataatc aggcagagtt 
33901 tcttgtagaa attgaataaa agaaaaattt gccaaaaaaa cattcaaaac ctctgggatg 
33961 caaatgcaat aggttaccgc gctgcgctcc aacattgtta gttttgaatt agtctgcaaa 
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34021 aataaaaaaa aaacaagcgt ■ catatcatag tagcctgacg aacaggtgga taaatcagtc 
Itoll tlll^ce.c aagacalglc acagggtctc cagotcgacc ^tcgtaaaac ctgtcatcgt 
34141 gattaaacaa cagcaccgaa agttcctcgc ggtgaccagc atgaataagt cttgatgaag 
34201 catacaatcc agacatgtta gcatcagtta aggagaaaaa acagccaaca tagcctttgg 
Itlll SaSattat gctSatcgt lagtatlgca aagccacccc tcgoggatac aaagtaaaag 
34321 gcacaggaga ataaaaaata taattatttc tctgctgctg tttaggcaac gtcgcccccg 
34381 Itc^ctltaa atacacatac aaagcctcat cagccatggc ttaccagaga aagtacagcg 
34441 ggcacacaaa ccacaagctc taaagtcact ctccaacctc tccacaatat atatacacaa 
34501 accctaa'act gacgtaatgg gactaaagtg taaaaaatcc cgccaaaccc aacacacacc 
34561 ccqaaactgc g?cIccagS laaagtacag tttcacttcc gcaatcccaa caagcgtcac 
llltl ?tcc?ct«c tcacggtlcg tcacatccca ttaacttaca acgtcatttt cccacggccg 
34681 cgccgcccct tttaaocgtt aaccccacag ccaatcacca cacggcccac actttttaaa 
34741 atcacctcat ttacatattg gcaccattcc atctataagg tatattattg atgatg 
SEQ ID NO: 1 
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1 ccattgcata cgttgtatcc atatcataat atgtacattt atattqcrctc atgtccaaca 
61 ttaccqccat gttgacattg attattgact agttattaat agtaatcaat tacggqgtca 
121 ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct 
IBl ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta 
241 acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac 
♦ 301 ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt 
361 aaatggccccy cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag 
421 tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat 
481 gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat 
541 gggagtttgt tttggcacca aaatceiacgg gactttccaa aatgtcgtaa caactccgcc 
601 ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 
661 ttaqtgaacc gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga 
721 caccgggacc gatccagcct ccgcggccgg gaacggtgca ttggaacgcg gattccccgt 
781 gccaagagtg a gatctaccA TGGGTGCTAG GGCTTCTGTG CTGTCTGGTG GTGAGCTGGA 
841 CAAGTGGGAG AAGATCAGGC TGAGGCCTGG TGGCAAGAAG AAGTACAAGC TAAAGCACAT 
901 TGTGTGGGCC TCCAGGGAGC TGGAGAGGTT TGCTGTGAAC CCTGGCCTGC TGGAGACCTC 
961 TGAGGGGTGC AGGCAGATCC TGGGCCAGCT CCAGCCCTCC CTGCAAACAG GCTCTGAGGA 
1021 GCTGAGGTCC CTGTACAACA CAGTGGCTAC CCTGTACTGT GTGCACCAGA AGATTGATGT 
1081 GAAGGACACC AAGGAGGCCC TGGAGAAGAT TGAGGAGGAG CAGAACAAGT CCAAGAAGAA 
1141 GGCCCAGCAG GCTGCTGCTG GCACAGGCAA CTCCAGCCAG GTGTCCCAGA ACTACCCCAT 
1201 TGTGCAGAAC CTCCAGGGCC AGATGGTGCA CCAGGCCATC TCCCCCCGGA CCCTGAATGC 
1261 CTGGGTGAAG GTGGTGGAGG AGAAGGCCTT CTCCCCTGAG GTGATCCCCA TGTTCTCTGC 
1321 CCTGTCTGAG GGTGCCACCC CCCAGGACCT GAACACCATG CTGAACACAG TGGGGGGCCA 
1381 TCAGGCTGCC ATGCAGATGC TGAAGGAGAC CATCAATGAG GAGGCTGCTG AGTGGGACAG 
1441 GCTGCATCCT GTGCACGCTG GCCCCATTGC CCCCGGCCAG ATGAGGGAGC CCAGGGGCTC 
1501 TGACATTGCT GGCACCACCT CCACCCTCCA GGAGCAGATT GGCTGGATGA CCAACAACCC 
1561 CCCCATCCCT GTGGGGGAAA TCTACAAGAG GTGGATCATC CTGGGCCTGA ACAAGATTGT 
1621 GAGGATGTAC TCCCCCACCT CCATCCTGGA CATCAGGCAG GGCCCCAAGG AGCCCTTCAG 
1681 GGACTATGTG GACAGGTTCT ACAAGACCCT GAGGGCTGAG CAGGCCTCCC AGGAGGTGAA 
1741 GAACTGGATG ACAGAGACCC TGCTGGTGCA GAATGCCAAC CCTGACTGCA AGACCATCCT 
1801 GAAGGCCCTG GGCCCTGCTG CCACCCTGGA GGAGATGATG ACAGCCTGCC AGGGGGTGGG 
1861 GGGCCCTGGT CACAAGGCCA GGGTGCTGGC TGAGGCCATG TCCCAGGTGA CCAACTCCGC 
1921 CACCATCATG ATGCAGAGGG GCAACTTCAG GAACCAGAGG AAGACAGTGA AGTGCTTCAA 
1981 CTGTGGCAAG GTGGGCCACA TTGCCAAGAA CTGTAGGGCC CCCAGGAAGA AGGGCTGCTG 
2041 GAAGTGTGGC AAGGAGGGCC ACCAGATGAA GGACTGCAAT GAGAGGCAGG CCAACTTCCT 
2101 GGGCAAAATC TGGCCCTCCC ACAAGGGCAG GCCTGGCAAC TTCCTCCAGT CCAGGCCTGA 
2161 GCCCACAGCC CCTCCCGAGG AGTCCTTCAG GTTTGGGGAG GAGAAGACCA CCCCCAGCCA 
2221 GAAGCAGGAG CCCATTGACA AGGAGCTGTA CCCCCTGGCC TCCCTGAGGT CCCTGTTTGG 
2281 CAACGACCCC TCCTCCCAGT AAaataaagc ccgggcagat ctgatctgc t gtgccttcta 
2341 gttgccagcc atctgttqtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca 
2401 ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc 
2461 attctattct ggggggtggg gtggggcagc acagcaaggg ggaggattgg gaagacaata 
2521 gcaggcatgc tggggatgcg gtgggctcta 
SEQ ID NO: 2 
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1 ccattgcata cqttgtatcc atatcataat atgtacattt atattggctc atgtccaaca 
61 ttaccgccat gttgacattg attattgact agttattaat agtaatcaat tacggggtca 
121 ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct 
181 ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta 
241 acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac 
301 ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt 
' 361 aaatqqcccg cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag 
421 -tacatctacg tattagtcat cgctattacc-atggtgatgc^ggttttggca gtacatcaat .._ 
481 gggcgtggat agcqgtttga ctcacgggga ttfcccaagtc tccaccccat tgacgtcaat 
541 gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcqtaa caactccgcc 
601 ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag ca gagctcgt 
661 ttagtgaacc gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga 
721 caccgggacc gatccagcct ccgcggccgg gaacggtgca ttggaacgcg ga ttccccgt 
781 gccaagagtg a gatcgatct aagtaagctt CCTGCATGCT GCTGCTGCTG CTGCTGCTGG 
841 GCCTGAGGCT ACAGCTCTCC CTGGGCATCA TCCCAGTTGA GGAGGAGAAC CCGGACTTCT 
901 GGAACCGCGA GGCAGCCGAG GCCCTGGGTG CCGCCAAGAA GCTGCAGCCT GCACAGACAG 
961 CCGCCAAGAA CCTCATCATC TTCCTGGGCG ATGGGATGGG GGTGTCTACG GTGACAGCTG 
1021 CCAGGATCCT AAAAGGGCAG AAGAAGGACA AACTGGGGCC TGAGATACCC CTGGCCATGG 
1081 ACCGCTTCCC ATATGTGGCT CTGTCCAAGA CATACAATGT AGACAAACAT GTGCCAGACA 
1141 GTGGAGCCAC AGCCACGGCC TACCTGTGCG . GGGTCAAGGG CAACTTCCAG ACCATTGGCT 
1201 TGAGTGCAGC CGCCCGCTTT AACCAGTGCA ACACGACACG CGGCAACGAG GTCATCTCCG 
1261 TGATGAATCG GGCCAAGAAA GCA6GGAA6T CAGTGGGAGT GGTAACCACC ACACGAGTGC 
1321 AGCACGCCTC GCCAGCCGGC ACCTACGCCC ACACGGTGAA CCGCAACTGG TACTCGGACG 
1381 CCGACGTGCC TGCCTCCGCC CGCCAGGAGS GGTGCCAGGA CATCGCTACG CAGCTCATCT 
1441 CCAACATGGA CATTGACGTG ATCCTAGGTG GAGGCCGAAA GTACATGTTT CGCATGGGAA 
1501 CCCCA6ACCC TGAGTACCCA GATGACTACA GCCAAGGTGG GACCAGGCTG GACGGGAAGA 
1561 ATCTGGTGCA GGAATGGCTG GCGAAGCGCC AGGGT6CCCG GTATGTGTGG AACCGCACTG 
1621 AGCTCATGCA GGCTTCCCT6 GACCCGTCTG TGACCCATCT CATGGGTCTC TTTGAGCCTG 
1681 GAGACATGAA ATACGAGATC CACCGAGACT CCACACTGGA CCCCTCCCTG ATGGA6ATGA 
1741 CAGAGGCTGC CCTGCGCCTG CTGAGCAGGA ACCCCCGCGG CTTCTTCCTC TTCGTGGAGG 
1801 GTGGTCGCAT CGACCATGGT CATCATGAAA GCAGGGCTTA CCGGGCACTG ACTGAGACGA 
1861 TCATGTTCGA CGACGCCATT GAGAGGGCGG GCCAGCTCAC CAGCGAGGAG GACACGCTGA 
1921 GCCTCGTCAC TGCCGACCAC TCCCACGTCT TCTCCTTCG6 AGGCTACCCC CTGCGAGGGA 
1981 GCTCCATCTT CGGGCTGGCC CCTGGCAAGG CCCGGGACAG GAAGGCCTAC ACGGTCCTCC 
2041 TATACGGAAA CGGTCCAGGC TATGTGCTCA AGGACGGCGC CCGGCCGGAT GTTACCGAGA 
2101 GCGAGAGCGG GAGCCCCGAG TATCGGCAGC AGTCAGCAGT GCCCCTGGAC GAAGAGACCC 
2161 ACGCAGGCGA GGACGTGGCG GTGTTCGCGC GCGGCCCGCA GGCGCACCTG GTTCACGGCG 
2221 TGCAGGAGCA GACCTTCATA GCGCACGTCA TGGCCTTCGC CGCCTGCCTG GAGCCCTACA 
2281 CCGCCTGCGA CCTGGCGCCC CCCGCCGGCA CCACCGACGC CGCGCACCCG GGTTAAcccg 
2341 tggtccccgc gttgcttcct ctgctggccg ggacatcagg tggcccccgc tgaattggaa 
2401 tcgatcagaa ttgatctgat ctgc tgtgcc ttctagttgc cagccatctg ttgtttgccc 
2461 ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa 
2521 tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg 
2581 gcagcacagc aagggggagg attgggaaga caatagcaqg catgctgggg atgcggtggg 
2641 ctcta 
SEQ ID NO: 3 
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1 catcatcaat aatatacccc acaaagtaaa 
* 61 tttagggcgg ggccagcgct gattggacga 
121 acggctaacg gtcgccgcgg aggcgtggcc 
181 cgtataaaaa agcggacttt agacccggaa 
241 tatgaggtaa ttctgggcgg atgcaagtaa 
301 gaggaagtga aaagtgaaaa ataccggtcc 
361 agagactttg accgattacg tgggggtttc 
421 tccgtgtcaa agtccggtgt ttatgtcaca 
481 gtcgagcccg tcaagaggcc actcttgagt 
541 ctcccagagt ctgagaaaaa tgagacacct 
601 catggccgca ttattgctgg aggattatgt 
661 atctccattt gagctgggac ctacacttca 
721 tgatgacgac ccgaacgaag aggctgtgaa 
781 ggctgacata gccagcgaag ctgtacctac 
841 acctgaattg gaagaggagg acgagctaga 
901 cagcgattca gaggacgaac agggtgagca 
961 tgtggttgtg gaagagcatt ttgtgttgga 
1021 atcctgccag taccaccggg ataagaccgg 
1081 catgaaaaag aacttcagct ttatttacag 
1141 gcttaacaca taactgggta atgcttaaac 
1201 ctaggtccgg tgtcagagga tgagtcatca 
12 61 gagctgtcag gcgaaacgcc cctgcaagtg 
1321 gagaggcgag cagctgttga aaaaattgag 
1381 cctttggacc tgagcttgaa acgccccagg 
1441 aaataaagtt gtacaataaa agtatatgtg 
1501 ggcgtggctt agtcctatat aagtggcaac 
1561 agttcctgat ggatgtgtgg actatccttg 
1621 aggatagttc agacgggtgc tccgggttct 
1681 gtctggtgta cacagttaag aaggattata 
1741 gctctggcct gctagattct ctaaatctcg 
iBOl tccacagcct tgatttttca agcccagggc 
1861 ttctggttga caaatggagc cagaacaccc 
1921 cagccatgca cctgtggagg gcatgggtga 
1981 ggcttataca gccagcagct ccgggtcttc 
2041 aggaagaaat gaggcaggcc atggacgaga 
2101 aagaggagct ggattgaatc aggtatccag 
2161 atccatggcc aggggagtga agagggagag 
2221 cgagctgacg gccagcctga tgaatcgcaa 
2281 acagatggag tgtagggatg aggtgggcct 
2341 eiaaaacccac tggttgaacc cagatgagga 
2401 gatagccctg cgcccagatt gcaagtacag 
2461 ctgctacatc tcggggaacg gggcagaggt 
2521 caggtgttgc atgatgggaa tgagagccgg 
2581 gaacatgaag ttcaatggag agaagtttaa 
2641 gaccctgcac ggctgcagtt tcttcggctt 
2701 tgctaagatc aggggatgta agttttatgg 
2761 gagcgagatg tctgtgaagc agtgtgtgtt 
2821 gggcaatgct agagtgagac attgctcttc 
2881 gggcacagcc tctctgaagc ataatatggt 
2941 catgctgaca tgcgactcgg gggtctgcca 
3001 cccccggaag aagtggccag tgtttgagaa 
3061 gggcgccaga aggggcacct tccagccgta 
3121 gctggagaac gatgccttct ccagggtgaa 
3181 ggtgtacaag atcctgagat acgatgagac 
3241 gggcagacac accaggatgc aaccagtggc 
3301 ccacctggtg atggcttgta ccgggaccga 
3361 gaggtaggtt gagtattagt gggcgtggct 
3421 gggtcttttt gcttttctgc agacatcatg 
3481 ctttttagcc cttatttgac aacccgcctg 
3541 atgggatcga cggtggacgg gcgtccagtg 
3601 gcgaccgtgg ggaactcgtc gctcgacagc 



caaaagttaa catgcaaatg agcttttgaa 
gagaagatga tgcaaatgac gtcacgacgc 
tagcccggaa gcaagtcgcg gggctgatga 
acggccgatt ttcccgcggc cacgcccgga 
aattaggtca ttttggcgcg aaaactgaat 
cgcccagggc ggaatattta ccgagggccg 
gattgcggtg ttttttcgcg aatttccgcg 
gatcagctga tccacagggt atttaaacca 
gccagcgagt agagatttct ctgagctccg 
gcgcctcctt tcttcaactg tgcctattga 
gagtacaata ttggaggacg aactgcatcc 
ggacctatat gatttggagg tagatgccca 
tttaatattt ccagaatctc tgattcttca 
accacttcat acaccgactc tgtcacccat 
cctccgatgt tatgaggaag gttttcctcc 
gagcatggct ctaatctcaa aatatgcttg 
caatcctgag gtgcccgggc aaggctgtag 
agacacgaac gcctcctgcg ctctgtgtta 
taagtggagt gaatgtgaga gagactgagt 
agctgtgcta agtgtggttt atttttgttt 
ccctcagaag aagaccaccc gtgtccccct 
cacagaccca ccccagtcag acccagtggc 
gacttgttac atgacatggg tggggatgaa 
aactaggctc agctgtgctt agtcatgtgt 
acgcatgcaa ggtgtggttt atgactcatg 
acctgggcac tggggcacag accttcaggg 
cagactttag caagacacgc cggcttgtag 
ggagacactg gtttggaact cctctatctc 
acgaggaatt tgaaaatctt tttgctgatt 
gccaccagtc ccttttccag gaaagggtac 
gcactacagc cggggttgct tttgtggttt 
aactgagcag gggctacatt ctggacttcg 
ggcagcgggg acagagaatc ttgaactact 
ttcgtctaca cagacaaaca tccatgttgg 
acccgaggag cggcctggac cctccgtcgg 
cctgtaccca gagcttagca gggtgctgac 
gagcgatggg ggcaataccg ggatgatgac 
gcgtccagag cgcattacct ggcacgagct 
gatgcaggat aaatatggcc tggagcagat 
ttgggaggag gccattaaga aatatgccaa 
ggtgaccaag acggtgaata tcagacatgc 
ggtcatcgat accctggaca aggccgcctt 
agtgatgaat atgaattcca tgattttcat 
tggggtgatg ttcatggcca acagtcacat 
caacaatatg tgcgcagagg tctggggcgc 
ctgctggatg ggcgtggtcg gaagacccaa 
tgagaaatgc tacctgggag tctctaccga 
cctggagacg ggctgcttct gcctggtgaa 
gaagggctgc acggatgagc gcatgtacaa 
tatcctgaag aacatccatg tgacctccca 
taacctactg atcaagtgcc acatgcacct 
ccagtgcaac tttagccaga ccaagctgct 
cctgaacggc atctttgaca tggatgtctc 
caagtccaigg gtgcgcgctt gcgagtgcgg 
cctggatgtg accgaggagc tgaggcccga 
gttcagctcc agtggggagg acacagatta 
aaggtgacta taaaggcggg tgtcttacga 
aacgggactg gcggggcctt cgaagggggg 
ccgggatggg ccggagttcg tcagaatgtg 
cttccagcaa attcctcgac catgacctac. 
accgccgcag ccgcggcagc cgcagccgcc 
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3661 atgacagcga cgagactggc ttcgagctac 
3721 cccagttcca tcatcgccga ggagaaactg 
3781 agccgccagc tggccgccct gacccagcag 
3841 caaaataaat gattcaataa acacagattc 
3901 tattttttcg cgcgcggtag gccctggtcc 
3961 ttttttccag gacccggtag aggtgggatt 
4021 cccgggggtg gaggtagcac cactgcatgg 
4081 tccagtcata gcaggggcgc tgggcgtggt 
4141 tggccacggg gagccccttg gtgtaggtgt 
4201 tgcgggggga gatgatgtgg agtttggcct 
4261 gatcccgcct ggggttcatg ttgtgcagga 
4321 ggaacttgtc atgcaacttg gaagggaatg 
4381 cacccaggtt ttccatgcac tcatccatga 
4441 tggcaaagac gtttctgggg tcagagacat 
4501 aagacatttt aatgaatttg gggcggaggg 
4561 gccccggggc gaagttcccc tcacatattt 
4621 ggatcatgtc cacctgcggg gcgatgaaaa 
4681 gcgaggagag caggtttctc aacagctggg 
4741 ccccgatgac gggttgcagg tggtagttca 
4801 ggggggccac ctcgttgagc atgtctctga 
4861 ggaggcggtc cccgcccagc gagagcagct 
4921 tgagcccgtc ggccatgggc atcttggcga 
4981 agagctcggt gacgtgctct acggcatctc 
5041 gggacgactg cgactgtagg gcacgagacg 
5101 cttccagggt ctcagtgtcc gcgtgagcgt 
5161 gggctgtgcg cttgcaaggg tgcgcttgag 
5221 gtcttcgccc tgcgcgtcgg cgagatagca 
5281 ggcggcgtgg cccttggcgc ggagcttgcc 
5341 gagggattgc agggcgtaga gcttgggtgc 
5401 cgctccgcag tgggcgcaga cggtctcgca 
5461 ggggtcaaaa accagttttc ccccgttctt 
5521 gagtctgtgt ccgcgctcgg tgacaaacag 
5581 gggcctgtcc tgcaggggcg tcccgcggtc 
5641 gacgaaggcg cgcgtccacg ccaagacaaa 
5701 gtccaccagg gggtccacct tttccacggt 
5761 caagaaggtg attggcttgt aggtgtaggc 
5821 ataaaagggg gcgggtctgt gctcgtcctc 
5881 cagctgttgg ggtaggtatt ccctttcgag 
5941 agtttctaga aacgaggagg atttgatgtt 
6001 actttcatcc atctggtcag aaaagactat 
6061 gccatagagg gcgttggaga gaagcttggc 
6121 gtcggctcgc tccttggccg cgatgttgag 
6181 ttcggggaag acggtggtgc gctcgtcggg 
6241 cagggtgacc agatccacgc tggtggccac 
6301 gaggcgtccg cccttgcgcg agcagaacgg 
6361 ggggtccgca tcgatggtga agatgcccgg 
6421 tgaggatgca tcatccaagg ccatctgcca 
6481 gttgaggggc ggaccccagg gcatgggatg 
6541 gtcgtagaca tagatgggct ccgagaggat 
6601 gcggatgctg gcgcgcacgt agtcatacaa 
6661 gagattggtg cgctggggct gctcggcgcg 
6721 gttggaggag atggtgggcc gttggaagat 
6781 gtcgcggatg aagtgcgcgt aggagtcttg 
6841 gacgtccatg gcgcagtagt ccagcgtttc 
6-901 cttctcccat agctcgcggt tgagggcgta 
6961 cgggaatcct cgatcgtccg cacggtaaga 
7021 gtagggacag cagcccttct ccacggggag 
7081 ggtgtgcgtc agggcgaagg tatccctgac 
7141 cgagtcgtcg cagccgccgt gctcccagag 
7201 gttaggcaga gcgaaagtga cgtcattgaa 
7261 gcgggtgatg cggaaagggc ccgggacgga 



atgcccagca gcagcagtag cccctctgtg 
ctggccctgc tggccgagct ggaagccctg 
gtgtccgagc tccgcgaaca gcagcagcag 
tgattcaaac agcaaagcat ctttattatt 
acctctcccg atcattgaga gtgcggtgga 
ggatgttgag gtacatgggc atgagcccgt 
cctcgtgctc tggggtcgtg ttgtagatga 
gctggatgat gtccttgagg aggagactga 
tggcgaagcg gttgagctgg gagggatgca 
ggatcttgag gttggcgatg ttgccaccca 
ccaccagaac ggtgtagccc gtgcacttgg 
cgtgaaagaa tttggagacg cccttgtgcc 
tgatggcgat gggcccgtgg gctgcggctt 
cgtaattatg ctcctgggtg agatcatcat 
tgccagattg ggggacaatg gttccctcgg 
gcatctccca ggctttcatc tcggaggggg 
aaacggtttc cggggcgggg gtgatgagct 
acttgccgca cccggtcggg ccgtagatga 
aggacatgca gctgccgtcg tcccgg^gga 
cttggaggtt ttcccggacg agctcgccga 
cttgcaggga agcaaagttt ttcaggggct 
gggtctgcga gaggagttcg aggcggtccc 
gatccagcag acttcctcgt ttcgggggtt 
atgggcgtcc agcgctgcca gcgtcatgtc 
ggtctccgtc acggtgaagg ggtgggcccc 
actcatcctg ctggtgctga aacgggcacg 
gttgaccatg agctcgtagt tgagggcctc 
cttggaagag cgcccgcagg cgggacagag 
gagaaagacg gactcggggg cgaaagcatc 
ctcgaccagc caggtgagct cgggctgctc 
tttgatgcgc ttcttacctc gcgtctccat 
gctgtctgtg tccccgtaga cggacttgat 
ctcctcgtag agaaactcgg accactctga 
ggaggccacg tgcgaggggt agcggtcgtt 
atgcagacac atgtccccct cctccgcatc 
cacgtgaccc ggggtccccg acgggggggt 
actctcttcc gcgtcgctgt ccacgagcgc 
agcgggcatg acctcggcac tcaggttgtc 
ggcttgccct gccgcaatgc tttttaggag 
ttttttattg tcaagcttgg tggcgaagga 
gatggatctc atggtctgat ttttgtcacg 
ctggacatac tcgcgcgcga cgcacttcca 
cacgatcctg acgcgccagc cgcggttatg 
ctcgccgcgc aggggctcgt tggtccagca 
gggcagcaca tcaagcagat gctcgtcagg 
acagagttcc ttgtcaaaat aatcgatttt 
ctcgcgggcg gccagcgctc gctcgtaggg 
cgtcagggcg gaggcgtaca tgccgcagat 
gccgatgtag gtgggataac agcgcccccc 
ctcgtgcgag ggggccaaga aggcggggcc 
gaagacgatc tggcgaaaga tggcatgcga 
gttaaagtgg gcatgaggca gacgaaccga 
cagcttggcg acgagctcgg cggtgacgag 
gcggatgatg tcataacccg cctctccttt 
ctcctcgtca tccttccagt actcccggag 
gcccagcatg tagaaatggt tcacggcctt 
ggcgtaagct tgagcggcct tgcggagcga 
catgactttc aagaactggt acttgaaatc 
ctcgaaatcg gtgcgcttct tcgagagggg 
gagaatcttg cctgcccgcg gcatgaaatt 
ggctcggttg ttgatgacct gggcggcgag 
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7321 gacgatctcg tcgaagccgt tgatgttgtg cccgacgatg tagagttcca tgaatcgcgg 
7381 gcggccttta atgtgcggca gctttttgag ctcctcgtag gtgaggtcct cggggcaatg 
7441 cagtccgtgc tgctcgagcg cccactcctg gagatgtggg ttggcttgca tgaatgaagc 
7501 ccagagctcg cgggccataa gggtctggag ctcgtcgcga aagaggcgga actgctggcc 
7561 cacggccatc ttttctgggg tgacgcagta . gaaagtaagg gggtcccgct cccagcgatc 
7621 ccagcgtaag cgcacggcta gatcgcgagc gagggcgacc agctctgggt cccccgagaa 
7681 tttcataacc agcataaagg ggacgagctg cttgccgaag gaccccatcc aggtgtaggt 
7741 ttctacatcg taggtgacaa agagccgctc cgtgcgagga tgagagccga ttgggaagaa 
7801 ctggatttcc tgccaccagt tggacgagtg gctgttgatg tgatgaaagt agaaatcccg 
7861 ccggcgaacc gagcactcgt gctgatgctt gtaaaagcgt ccgcagtact cgcagcgctg 
7921 cacgggctgt acctcatcca cgagatacac agcgcgtccc . ttgaggagga acttcaggag 
7981 tggcggccct ggctggtggt tttcatgttc gcctgcgtgg gactcaccct ggggctcctc 
8041 gaggacggag aggctgacga gcccgcgcgg gagccaggtc cagatctcgg cgcggcgggg 
8101 gcggagagcg aagacgaggg cgcgcagttg ggagctgtcc atggtgtcgc ggagatccag 
8161 gtccgggggc agggttctga ggttgacctc gtagaggcgg gtgagggcgt gcttgagatg 
8221 cagatggtac ttgatctcca cgggtgagtt ggtggctgtg tccacgcatt gcatgagccc 
8281 gtagctgcgc ggggccacga ccgtgccgcg gtgcgctttt agaagcggtg tcgcggacgc 
8341 gctcccggcg gcagcggcgg ttccggcccc gcgggcaggg gcggcagagg cacgtcggcg 
8401 tggcgctcgg gcaggtcccg gtgctgcgcc ctgagagcgc tggcgtgcgc gacgacgcgg 
8461 cggttgacat cctggatctg ccgcctctgc gtgaagacca ccggccccgt gactttgaac 
8521 ctgaaagaca gttcaacaga atcaatctcg gcgtcattga cggcggcctg acgcaggatc 
8581 tcttgcacgt cgcccgagtt gtcctggtag gcgatctcgg acatgaactg ctcgatctcc 
8641 tcctcctgga gatcgccgcg gcccgcgcgc tccacggtgg cggcgaggtc attggagatg 
8701 cgacccatga gctgcgagaa ggcgcccagg ccgctctcat tccagacgcg gctgtagacc 
8761 acgtccccgt cggcgtcgcg cgcgcgcatg accacctgcg cgaggttgag ctccacgtgc 
8821 cgcgtgaaga cggcgtagtt gcgcaggcgc tggaagaggt agtttagggt ggtggcgatg 
8881 tgctcggtga cgaagaagta catgatccag cggcgcaggg gcatctcgct gatgtcgccg 
8941 atggcctcca gcctttccat ggcctcgtag aaatccacag cgaagttgaa aaactgggcg 
9001 ttgcgggccg agaccgtgag ctcgtcctcc aggagcctga tgagttcggc gatggtggcg 
9061 cgcacctcgc gctcgaaatc cccgggggcc tcctcctctt cctcttcttc catgacgacc 
9121 tcttcttcta tttcttcctc tgggggcggt ggtggtggcg gggcccgacg acgacggcga 
9181 cgcaccggga gacggtcgac gaagcgctcg atcatctccc cgcggcggcg acgcatggtt 
9241 tcggtgacgg cgcgaccccg ttcgcgagga cgcagcgtga agacgccgcc ggtcatctcc 
9301 cggtaatggg gcgggtcccc gttgggcagc gagagggcgc tgacgatgca tcttatcaat 
9361 tgcggtgtag gggacgtgag cgcgtcgaga tcgaccggat cggagaatct ttcgaggaaa 
9421 gcgtctagcc aatcgcagtc gcaaggtaag ctcaaacacg tagcagccct gtggacgctg 
9481 ttagaattgc ggttgctgat gatgtaattg aagtaggcgt ttttaaggcg gcggatggtg 
9541 gcgaggagga ccaggtcctt gggtcccgct tgctggatgc gaagccgctc ggccatgccc 
9601 caggcctggc cctgacaccg gctcaggttc ttgtagtagt catgcatgag cctctcaatg 
9661 tcatcactgg cggaggcgga gtcttccatg cgggtgaccc cgacgcccct gagcggctgc 
9721 acgagcgcca ggtcggcgac gacgcgctcg gcgaggatgg cctgttgcac gcgggtgagg 
9781 gtgtcctgga agtcgtccat gtcgacgaag cggtggtagg ccccggtgtt gatggtgtag 
9841 gtgcagttgg ccatgagcga ccagttgacg gtctgcaggc cgggttgcac gacctctgag 
9901 tacctgagcc gcgagaaggc gcgcgagtcg aagacatagt cgttgcaggt gcgcacgagg 
9961 tactggtatc caactaggaa gtgcggcggc ggctggcggt agagcggcca gcgctgggtg 
10021 gccggcgcgc ccggggccag gtcctcgagc atgaggcggt ggtagccgta gaggtagcgg 
10081 gacatccagg tgatgccggc ggcggtggtg gaggcgcgcg ggaactcgcg gacgcggttc 
10141 cagatgttgc gcagcggcag gaaatagtcc atggtcggca cggtctggcc ggtgagacgc 
10201 gcgcagtcat tgacgctcta gaggcaaaaa cgaaagcggt tgagcgggct cttcctccgt 
10261 agcctggcgg aacgcaaacg ggttaggccg cgtgtgtacc ccggttcgag tcccctcgaa 
10321 tcaggctgga gccgcgacta acgtggtatt ggcactcccg tctcgacccg agcccgatag 
10381 ccgccaggat acggcggaga gccctttttg ccgaccgagg ggagtcgcta gacttgaaag 
10441 cggccgaaaa ccccgccggg tagtggctcg cgcccgtagt ctggagaagc tttgccaggg 
10501 ttgagtcgcg gcagaacccg gttcgcggac ggccgcggcg agcgggractt ggtcaccccg 
10561 ccgatttaaa gacccacagc cagccgactt ctccagttac gggagcgagc cccctttttt 
10621 ctttttgcca gatgcatccc gtcctgcgcc aaatgcgtcc caccccccct ccggcgacca 
10681 ccgcgaccgc ggccgtagca ggcgccggcg ctgtagcccc gccacagcag acagagatgg 
10741 acttggaaga gggcgaaggg ctggcgagac tgggggcgcc gtccccggag cgacaccccc 
10801 gcgtgcagct gcagaaggac gtgcgcccgg cgtacgtgcc tgcgcagaac ctgttcaggg 
10861 accgcagcgg ggaggagccc gaggagatgc gcgactgccg ttttcgggcg ggcagggagc 
10921 tgcgcgaggg cctggaccgc cagcgcgtgc tgcgcgacga ggatttcgag ccgaacgagc 
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10981 agacggggat cagccccgcg cgcgcgcacg tggcggcggc caacctggtg acggcctacg 
11041 agcagacggt gaagcaggag cgcaacttcc aaaagagttt caacaaccat gtgcgcacgc 
11101 taatcgcgcg cgaggaggtg gccctgggct tgatgcacct gtgggacctg gcggaggcca 
11161 tcgtgcagaa cccggacagc aagcctctga cggcgcagct gttcctggtg gtgcagcaca 
11221 gcagggacaa cgaggcgttc agggaggcgc tgctaaacat cgccgagccc gagggccgct 
11281 ggctgctgga gctgatcaac atcttgcaga gcatcgtagt gcaggagcgc agcctgagcc 
11341 tggccgagaa ggtggcggct atcaactact cggtgctgag cctgggcaag ttttacgcgc 
11*401 gcaagattta caagacgccg tacgtgccca tagacaagga ggtgaagata gacagctttt 
11461 acatgcgcat ggcgctcaag gtgctgacgc tgagcgacga cctgggcgtg taccgcaacg 
11521 accgcatcca caaggccgtg agcgcgagcc ggcggcgcga gctgagcgac cgcgagctga 
11581 tgctgagtct gcgccgggcg ctggtagggg gcgccgccgg cggtgaggag tcctacttcg 
11641 acatgggggc ggacctgcat tggcagccga gccggcgcgc cttggaggcc gcctacggtc 
11701 cagaggactt ggatgaggat gaggaagagg aggaggatgc acccgctgcg gggtactgac 
11761 gcctccgtga tgtgttttta gatgcagcaa gccccggacc ccgccataag ggcggcgctg 
11821 caaagccagc cgtccggtct agcatcggac gactgggagg ccgcgatgca acgcatcatg 
11881 gccctgacga cccgcaaccc cgagtccttt agacaacagc cgcaggccaa cagactctcg 
11941 gccattctgg aggcggtggt cccctctcgg accaacccca cgcacgagaa ggtgctggcg 
12001 atcgtgaacg cgctggcgga gaacaaggcc atccgtcccg acgaggccgg gctggtgtac 
12061 aacgccctgc tggagcgcgt gggccgctac aacagcacaa acgtgcagtc caacctggac 
12121 cggctggtga cggacgtgcg cgaggccgtg gcgcagcgcg agcggttcaa gaacgagggc 
12181 ctgggctcgt tggtggcgct gaacgccttc ctggcgacgc agccggcgaa cgtgccgcgc 
12241 gggcaggacg attacaccaa ctttatcagc gcgctgcggc tgatggtgac cgaggtgccc 
12301 cagagcgagg tgtaccagtc gggcccagac tactttttcc agacgagccg gcagggcttg 
12361 cagacggtga acctaagcca ggctttcaag aatctgcgcg ggctgtgggg cgtgcaggcg 
12421 cccgtgggcg accggtcgac ggtgagcagc ttgctaacgc ccaactcgcg gctgctgctg 
12481 ctgctgatcg cgcccttcac cgacagcggc agcgtgaacc gcaactcgta cctgggccac 
12541 Gtgctgacgc tttaccgcga ggccataggc caggcgcagg tggacgagca gaccttccag 
12601 gagatcacta gcgtgagccg cgcgctgggt cagaacgaca ccgacagtct gagagccacc 
12661 ctgaacttct tgctgacaaa tagacagcag aagattccgg cgcagtacgc gctgtcggcc 
12721 gaggaggagc gcatcctgag atatgtgcag cagagcgtag ggcttttcct gatgcaggag 
12781 ggggccaccc ccagcgccgc gctggacatg accgcgcgca acatggaacc tagcatgtac 
12841 gccgccaacc ggccgttcat caataagctg atggactacc tgcaccgcgc ggctgccatg 
12901 aactcggact actttactaa tgctatacta aacccgcact ggctcccgcc gccggggttc 
12961 tacacgggcg agtacgacat gcccgacccc eiacgatgggt tcctgtggga cgacgtggac 
13021 agcgcggtgt tctccccgac cttgcaaaag cgccaggagg cggtacgcac gcccgcgagc 
13081 gagggcgcgg tgggtcggag cccctttcct agcttaggga gtttgcatag cttgccgggc 
13141 tcggtgaaca gcggcagggt gagccggccg cgcttgctgg gcgaggacga gtacctgaac 
13201 gactcgctgc tgcagccgcc gcgggtcaag aacgccatgg ccaataacgg gatagagagt 
13261 ctggtggaca aactgaaccg ctggaagacc tacgctcagg accataggga tgcgcccgcg 
13321 ccgcggcgac agcgccacga ccggcagcgg ggcctggtgt gggacgacga ggactcggcc 
13381 gacgatagca gcgtgttgga cttgggcggg agcggtgggg ccaacccgtt cgcgcatctg 
13441 cagcccagac tggggcgacg gatgttttga atgaaataaa actcaccaag gccatagcgt 
13501 gcgttctctt ccttgttaga gatgaggcgc gcggtggtgt cttcctctcc tcctccctcg 
13561 tacgagagcg tgatggcgca ggcaaccct'g gaggttccgt ttgtgcctcc gcggtatatg 
13621 gctcctacgg agggcagaaa cagcattcgt tactcggaac tggctccgca gtacgacacc 
13681 actcgcgtgt acttggtgga caacaagtcg gcggacatcg cttccctgaa ctaccaaaac 
13741 gaccacagca acttcctgac cacggtggtg cagaacaacg atttcacccc cgccgaggcc 
13801 agcacgcaga cgataaattt tgacgagcgg tcgcggtggg gcggtgattt gaagaccatt 
13861 ctgcacacca acatgcccaa tgtgaacgag tacatgttca ccagcaagtt taaggcgcgg 
13921 gtgatggtgg ctaggaaggt ggtagatcag aatgatagga gcaaggatga gttaaaatat 
13981 gagtggtttg agtttaccct gcccgagggc aacttttccg agaccatgac catagacctg 
14041 atgaacaacg ccatcttgga aaactacttg caagtggggc ggcaaaatgg cgtgctggag 
14101 agcgatatcg gagtcaagtt tgacagcagg aatttcaagc tgggctggga cccggtaacc 
14161 aagctggtga tgcctggggt ctacacctac gaggccttcc acccggacgt tgtgctgctg 
14221 ccgggctgcg gggtggactt caccgagagc cgcctgagca acctcctggg cattcgcaag 
14281 aagcaacctt tccaagaggg cttcaggatc atgtatgagg atctcgaggg tggtaacatc 
14341 cccgccctcc tggatgtcaa gcaatatttg gatagtaaaa agaagcttga ggaggcaaca 
14401 cagaatgcaa ccagggctgc tggagatatc agaggagaca gtcatattcc aagagctgtg 
14461 gaacaagcgg ctgaaaagga tctggtcatt gtaccagtaa cacaagatga aagtaagaga 
14521 agctataatg tcatagatgg cacccatgac accctctacc gaagttggta cctgtcctat 
14581 acctacgggg accccgagaa gggggtgcag tcgtggacgc tgctcaccac cccggacgtc 
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14641 acctgcggcg cggagcaagt ctactggtcg ctgccggacc tcatgcaaga ccccgtcacc 
14701 ttccgctcta cccagcaagt cagcaactac cccgtggttg gcgccgagct catgcccttc 
14761 cgcgccaaga gcttttacaa cgacctcgcc gtctactccc agctcatccg cagctacacc 
14821 tccctcaccc acgtcttcaa ccgcttcccc gacaaccaga tcctctgccg tccgcccgcg 
14881 cccaccatca ccacggtcag tgaaaacgtg cctgctctca cagatcacgg gacgctaccg 
14941 ctgcgcagca gtatccgcgg agtccagcga gtgaccgtca ctgacgcccg tcgccgcacc 
15001 tgtccctacg tctacaaggc cctgggcata gtcgcgccgc gcgtgctttc cagtcgcacc 
15061 ttctaaaaaa tgtctattct catctcgccc agcaataaca ccggctgggg tcttactagg 
15121 cccagcacca tgtacggagg agccaagaag cgctcccagc agcaccccgt ccgcgtccgc 
15181 ggccacttcc gcgctccctg gggcgcttac aagcgcgggc ggacttctac cgccgccgtg 
15241 cgcaccaccg tcgacgacgt catcgactcg gtggtcgccg acgcgcgcaa ctataccccc 
153 01 gccccctcca ccgtggacgc ggtcatcgac agcgtggtgg ccgacgcgcg cgactatgcc 
15361 agacgcaaga gccggcggcg acggatcgcc aggcgccacc ggagtacgcc cgccatgcgc 
15421 gccgcccggg ctctgctgcg ccgcgccaga cgcacgggcc gccgggccat gatgcgagcc 
15481 gcgcgccgcg ccgccactgc accccccgca ggcaggactc gcagacgagc ggccgccgcc 
15541 gctgccgcgg ccatttctag catgaccaga cccaggcgcg gaaacgtgta ctgggtgcgc 
15601 gactccgtca cgggcgtgcg cgtgcccgtg cgcacccgtc ctcctcgtcc ctgatctaat 
15661 gcttgtgtcc tcccccgcaa gcgacgatgt caaagcgcaa aatcaaggag gagatgctcc 
15721 aggtcgtcgc cccggagatt tacggaccac cccaggcgga ccagaaaccc cgcaaaatca 
15781 agcgggttaa aaaaaiaggat gaggtggacg agggggcagt agagtttgtg cgcgagttcg 
15841 ctccgcggcg gcgcgtaaat tggaaggggc gcagggtgca gcgcgtgttg cggcccggca 
15901 cggcggtggt gtttacgccc ggcgagcggt cctcggtcag gagcaagcgt agctatgacg 
15961 aggtgtacgg cgacgacgac atcctggacc aggcggcgga gcgggcgggc gagttcgcct 
16021 acgggaagcg gtcgcgcgaa gaggagctga tctcgttgcc gctggacgag agcaacccca 
16081 cgcctagcct gaagcccgtg accctgcagc aggtgctgcc ccaagcagtg ctgctgccga 
16141 gccgcggggt caagcgcgag ggcgagaata tgtacccgac catgcagatc atggtgccca 
16201 agcgccggcg cgtggaagaa gtgctggaca ccgtgaaaat ggatgtggag cccgaggtca 
16261 aggtgcgccc catcaagcag gtggcgccgg gcctgggcgt gcagaccgtg gacattcaga 
16321 tccccaccga catggatgtt gacaaaaaac cctcgaccag catcgaggtg cagaccgacc 
16381 cctggctccc agcctccacc gctgccgtct ccacttctac cgccgccacg gctaccgagc 
16441 ctcccagaag gcgaagatgg ggccctgcca accggctgat gcccaactac gtattgcatc 
16501 cttccattat cccgacgccg ggctatcgcg gcacccggta ctacgccagc cgcaggcgcc 
16561 cagccagcaa acgccgccgc cgcaccgcca cccgccgccg tctggccccc gcccgcgtgc 
16621 gccgcgtaac cacgcgccgg ggccgctcgc tcgttctgcc caccgtgcgc taccacccca 
16681 gcatccttta atccgtgtgc tgtgatactg ttgcagagag atggctctca cttgccgcct 
16741 gcgcatcccc gtcccgaatt accgaggaag atcccgccgc aggagaggca tggcaggcag 
16801 cggcctcaac cgccgccggc ggcgggccat gcgcaggcgc ctgagtggcg gctttctgcc 
16861 cgcgctcatc cccataatcg cggcggccat cggcacgatc ccgggcatag cttccgttgc 
16921 gctgcaggcg tcgcagcgcc gttgatgtgc gaataaagcc tctttagact ctgacacacc 
16981 tggtcctgta tatttttaga atggaagaca tcaattttgc gtccctggct ccgcggcacg 
17041 gcacgcggcc gttcatgggc acctggeiacg agatcggcac cagccagctg aacgggggcg 
17101 ccttcsiattg gagcagtgtc tggagcgggc ttaaaaattt cggctcgacg ctccggacct 
17161 atgggaacaa ggcctggaat agtagcacgg ggcagttgtt aagggaaaag ctcaaagacc 
17521 agaacttcca gcagaaggtg gtggacggcc tagcctcggg cattaacggg gtggtggaca 
17281 tagcaaacca ggccgtgcag cgcgagataa acagccgcct ggacccgcgg ccgcccacgg 
17341 tggtggagat ggaagatgca actcctccgc cgcccaaggg cgagaagcgg ccgcggcccg 
17401 acgcggagga gacgatcctg caggtggacg agccgccctc gtacgaggag gcccrtcaagg 
17461 ccggcatgcc caccacgcgt atcatcgcgc cactggccac tggtgtaatg aaacccgcca 
17521 cccttgacct gcctccgcca cccacgcccg ctccaccgaa ggcagctccg gttgtgcagc 
17581 cccctcctgt ggcgaccgcc gtgcgccgcg tccccgcccg ccgccaggcc cagaactggc 
17641 agagcacgct gcacagtatc gtgggcctgg gagtgaaaag tctgaagcgc cgccgatgct 
17701 attgagagag aggaaagagg acactaaagg gagagcttaa cttgtatgtg ccttaccgcc 
17761 agagaacgcg cgaagatggc taccccctcg atgatgccgc agtgggcgta catgcacatc 
17821 gccgggcagg acgcctcgga gtacctgagc ccgggtctgg tgcagtttgc ccgcgccacc 
17881 gacacgtact tcagcctggg caacaagttt aggaacccca cggtggctcc cacccacgat 
17941 gtgaccacgg accggtccca gcgtctgacg ctgcgctttg tgcccgtgga tcgcgaggac 
18001 accacgtact cgtacaaggc gcgcttcact ctggccgtgg gcgacaaccg ggtgctagac 
18061 atggccagca cttactttga catccgcggc gtcctggacc gcggtcccag cttcaaaccc 
18121 tactcgggca cggcttacaa cagcctggcc cccaaaggcg cccccaactc tagtcagtgg 
18181 gaacaagcta aagctaccaa tgccggtcaa aaggaaactc acacatttgg agtagccgct 
18241 atgggcggag aagacattac agtgaaaggt cttcaaattg gaactgatga aactaaggaa 
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18301 gatggagagg atgaaatttt tgcagatcaa acattccagc cagaacctca agtgggagaa 
18361 cagaactggc aagaaacgtt tgttttctat ggaggcagag ctcttaagaa agaaaccaaa 
18421 atgaagccat gttatggctc ttatgcgaga cccacaaatg aaaagggagg acaggctaaa 
18481 tttacacttg atgaaaaagg tcagccaacc aaaattcctg atattacaat ggatttcttt 
18541 gatagtccac aagatgatac atcaggtgta actaataagc cagatattgt catgtatgca 
18601 gaaaatgtaa atttagaagc tcctgacaca catgtagttt acaaaccagg caaagatgat 
18661 tctagttctt ccgctaacct cacacaacag gccatgccta acagaccgaa ctacatcggg 
18721 ttcagagaca actttgtggg tcttatgtac tacaatagta ctggcaacat gggtgtgctg 
18781 gctggtcagg cctctcagtt gaatgctgtg gtcgacttgc aagacagaaa caccgagctg 
18841 tcttaccagc tattgctaga ttctctgggt gacagaacca gatactttag catgtggaat 
18901 tctgcagtgg acagctatga ccccgatgtc aggatcattg agaatcacgg tgtggaagat 
18961 gaacttccaa actattgctt cccactgaat ggcagtggtt ctaacagcac atacaaaggt 
19021 gttaaagctg gaactggaaa caattgggat gacgatgaaa atgttgcaag acaaaatcag 
19081 attggcactg gcaacctgtt cgccatggag atcaacctcc aggccaacct atggaagagt 
19141 tttctgtact cgaacgtggc cctgtacctg cccgactcct acaagtacac gccggccaac 
19201 gtcacgctgc ccaccaacac caacacctac gactacatga acggccgcgt ggtagccccc 
19261 tcgctggtgg acgcctacat caacattggc gcccgctggt cgctggaccc catggacaat 
19321 gtcaatccct tcaaccacca ccgcaacgcg ggcctgcgct accgctccat gctcctgggc 
19381 aacggccgct acgtgccctt ccacatccaa gtgccccaaa agttctttgc catcaagaac 
19441 ctgcttctgc tccccggttc ctacacctac gagtggaact tccgcaagga cgtcsiacatg 
19501 atcctgcaga gttccctcgg caacgacctg cgcgtcgacg gcgcctccgt ccgcttcgac 
19561 agcgtcaacc tctacgccac cttcttcccc atggcgcaca acaccgcctc caccctggaa 
19621 gccatgctgc gcaacgacac caacgaccag tccttcaacg actacctctc ggccgccaac 
19681 atgctctacc ccatcccggc caaggccacc aacgtgccca tctccatccc ctcgcgcaac 
19741 tgggccgcct tccgcggctg gagtttcacc cggctcaaga ccaaggaaac tccctccctc 
19801 ggctcgggtt tcgaccccta ctttgtctac tcgggctcca tcccctacct cgacgggacc 
19861 ttctacctca accacacctt caagsiaggtc tccatcatgt tcgactcctc ggtcagctgg 
19921 cccggcaacg accggctgct cacgccgaac gagttcgaga tcaagcgcag cgttgacggg 
19981 gagggctaca acgtggccca atgcaacatg accaaggact ggttcctcgt ccagatgctc 
20041 tcccactaca acatcggcta ccagggcttc cacgtgcccg agggctacaa ggaccgcatg 
20101 tactccttct tccgcaactt ccagcccatg agcaggcagg tggtcgatga gatcaactac 
20161 aaggactaca aggccgtcac cctacccttc cagcacaaca actcgggctt caccggctac 
20221 cttgcgccca ccatgcgcca ggggcagccc taccccgcca acttccccta cccgctcatc 
20281 ggctccaccg cagttccctc cgtcacccag aaaaagttcc tctgcgacag ggtcatgtgg 
20341 cgcatcccat tctccagcaa ctttatgtcc atgggcgccc tcaccgacct gggtcagaac 
20401 atgctctatg ccaactcggc ccacgcgctc gacatgacct ttgaggtgga ccccatggat 
20461 gagcccaccc tcctctatct tctcttcgaa gttttcgacg tggtcagagt gcaccagccg 
20521 caccgcggcg tcatcgaggc cgtctacctg cgcacgccct tctccgccgg caacgctacc 
20581 acttaagcat gagcggctcc agcgaacaag agctcgcggc catcgtgcgc gacctgggat 
20641 gcgggcccta ctttttggga acccacgaca agcgcttccc tggcttcctt gccggcgaca 
2 07 01 agctggcctg cgccatcgtc aacacggccg gccgcgagac cggaggcgtg cactggctcg 
20761 cctttggctg gaatccgcgc tcgcgcacct gctacatgtt cgaccccttt gggttctcgg 
20821 accgccggct caagcagatt tacagcttcg agtacgaggc catgctgcgc cgaagcgcgc 
20881 ttgcctcctc gcccgaccgc tgtctcagcc tcgagcagtc cacccagacc gtgcaggggc 
20941 ccgactccgc cgcctgcgga cttttttgtt gcatgttttt gcatgccttc gtgcactggc 
21001 ccgaccgacc catggacgga aaccccacca tgaacttgct gacgggggtg ccaaacggca 
21061 tgctacaatc gccacaggtg ctgcccaccc tcaggcgcaa ccaggaggag ctctaccgct 
21121 tcctcgcgcg ccactcccct tactttcgat cccaccgcgc cgccatcgaa aacgccaccg 
21181 cttttgataa aatgaaacaa ctgcgtgtat ctcaataaac agcactttat tttacatgca 
21241 ctggagtata tgcaagttat ttaaaagtcg aaggggttct cgcgctcgtc gttgtgcgcc 
21301 gcgctgggga gggccacgtt gcggtactgg tacttgggaa gccacttgaa ctcggggatc 
21361 accagtttgg gcactggggt ctcggggaag gtctcgctcc acatgcgccg gctcatctgc 
21421 agggcgccca gcatgtccgg gccggagatc ttgaaatcac aattggggcc ggtgctctgc 
21481 gcgcgcgagt tgcggtacac ggggttgcag cactggaaca ccattagact ggggtacttc 
21541 acactggcaa gcacgctctt gtcgctgatc tgatccttgt ccaggtcctc ggcgttgctc 
21601 aggccgaacg gggtcatctt gcacagctgg cggcccagga agggcacgct ctgaggcttg 
21661 tggttacact cgcagtgcac gggcatcagc atcatccccg cgccgcgctg catattcggg 
21721 tagagggcct tgacgaaggc cgtgatctgc ttgaaagctt gctgggcctt agccccctcg 
21781 ctgaaaaaca ggccgcagct cttcccgcta aactggttat tcccgcaccc ggcatcatgc 
21841 acgcagcagc gcgcgtcatg gctggtcagt tgcaccacgc tacgtcccca gcggttctgg 
21901 gtcaccttgg ccttgctggg ctgctccttc aacgcgcgct gcccgttctc gctggtcaca 
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21961 tccatctcca ccacgtggtc cttgtggatc atcaccgtcc catgcagaca cttgagctga 
22021 ccctcgacat cgcagcagcc atgatcccac agggcgcagc cggtgcactc ccagttctta 
22081 tgcgcgatcc cgctgtggct gaagatgtaa ccttgcaaca ggcgacccat gacggtgcta 
22141 aatgctttct gggtggtgaa ggtcagttgc agaccgcggg cctcctcgtt catccaggtc 
22201 tggcacatct tttggaagat ctcggtctgc tcgggcatga gcttgtaagc atcgcgcagg 
222 61 ccgctgtcga cgcggtagcg ttccatcagc acgttcatgg tatccatgcc cttctcccag 
22321 gacgagacca gaggcagact cagggggttg cgcacgttca ggacaccggg ggtcgcaggc 
22381 tcgacgatgc gttttccgtc cttgccttcc ttcaacagaa ccggaggctg gctgaatccc 
22441 actcccacga ttacggcatc ttcctggggc atctcttcgt cggggtctac cttggtcaca 
22501 tgcttggtct ttctggcttg cttctttttt ggagggctgt ccacggggac cacgtcctcc 
22561 tcggaagacc cggagcccac ccgctgatac tttcggcgct tggtgggcag aggaggtggt 
22621 ggcggcgagg ggctcctctc ctgctccggc ggatagcgcg ccgacccgtg gccccggggc 
22681 ggagtggcct ctcgctccat gaaccggcgc acgtcctgac tgccgccggc cattgtttcc 
22741 taggggaaga tggaggagca gccgcgtaag caggagcagg aggaggactt aaccacccac 
22801 gagcaaccca aaatcgagca ggacctgggc ttcgaagagc cggctcgtct agaaccccca 
22861 caggatgaac aggagcacga gcaagacgca ggccaggagg agaccgacgc tgggctccag 
22921 catggctacc tgggaggaga ggaggatgtg ctgctaaaac acttgcagcg ccaatccatc 
22981 atcctccggg acgccctggc cgaccggagc gaaacccctc tcagcgtcga ggagctgtgt 
23041 cgggcctacg agctcaacct cttctcgccg cgcgtgcccc ccaaacgcca gcccaacggc 
23101 acctgcgagc ccaacccgcg tctcaacttc tatcccgtct ttgcggtccc cgaggcccta 
23161 gccacctatc acatcttttt caagaaccaa aagatccccg tctcctgccg cgccaaccgc 
23221 acccgcgccg acgcgctcct cgctctgggg cccggcgcgc gcatacctga tatcgcttcc 
23281 ctggaagagg tgcccaagat cttcgaaggg ctcggtcggg acgagacgcg cgcggcaaac 
23341 gctctgaaag aaacagcaga ggaagagggt cacactagcg ccctggtaga gttggaaggc 
23401 gacaacgcca ggctggccgt gctcaagcgc agcgtcgagc tcacccactt cgcctacccc 
23461 gccgtcaacc tcccgcccaa ggtcatgcgt cgcatcatgg atcagctcat catgccccac 
23521 atcgaggccc tcgatgaaag tcaggagcag cgccccgagg acgcccggcc cgtggtcagc 
23581 gacgagcagc tcgcgcgttg gctcgggacc cgcgaccccc aggctttgga acagcggcgc 
23641 aagctcatgc tggccgtggt cctggtcacc ctcgagctcg aatgcatgcg ccgcttcttc 
23701 agcgaccccg agaccctgcg taaggtcgag gagaccctgc actacacttt caggcacggt 
23761 ttcgtcaggc aggcctgcaa gatctccaac gtggagctga ccaacctggt ctcatgcctg 
23821 gggatcctgc acgagaaccg cctgggacag accgtgctcc actctactct gaagggcgag 
23881 gcgcgtcggg actatgtccg cgactgtgta tttctcttta tctgccacac ctggcaagca 
23941 gccatgggcg tgtggcagca gtgtctcgag gacgaaaatc tgaaggagct ggacaagctt 
24001 cttgctagaa accttaaaaa gctgtggacg ggcttcgacg agcgcaccgt cgcctcggac 
24061 ctggccgaga tcgtttttcc agaacgcctg aggcagacgc tgaaaggcgg gctgcccgac 
24121 ttcatgagcc agagcatgtt gcaaaactac cgcactttca ttctcgagcg atctgggatg 
24181 ctacccgcca cctgcaacgc attcccctcc gactttgtcc cgctgagcta ccgcgagtgt 
24241 cccccgccgc tgtggagcca ctgctatctc ttgcagctgg ccaactacat cgcctaccac 
24301 tcggacgtga tcgaggacgt gagcggcgag gggcttctcg agtgccactg ccgctgcaac 
24361 ctgtgctccc cgcaccgctc cctggtctgc aacccccagc ttctgagcga gacccaggtc 
24421 atcggtacct tcgagctgca aggtccgcag gagtccaccg ctccgctgaa actcacgccg 
24481 gggttgtgga cttccgcgta cctgcgcaaa tttgtacccg aggactacca cgcccatgaa 
24541 ataaagttct tcgaggacca atcgcgccca cagcacgcgg atctcacggc ctgcgtcatc 
24601 acccagggcg cgatcctcgc ccaattgcac gccatccaaa aatcccgcca agagtttctt 
24661 ctaaaaaagg gtagaggggt ctacctggac ccccagacgg gcgaggtgct caacccgggt 
24721 ctcccccagc atgccgagga agaagcagga gccgctagtg gagcagatgg aagaagaatg 
24781 ggacagccag gcagaggagg acgaatggga ggaggagaca gaggaggaag aattggaaga 
24841 ggtggaagag gagcaggaaa cagagcagcc cgtcgccgca ccatccgcgc cggcagcccc 
24901 gccggtcacg gatacaacct ccacagctcc ggccaagcct cctcgtagat gggatcgagt 
24961 gaagggtgac ggtaagcacg agcggcaggg ctaccgatca tggagggtcc acaaagpcgc 
25021 gatcatcgcc tgcttgcaag actgcggggg gaacatcgct ttcgcccgcc gctacctgct 
25081 cttccaccgc ggggtgaaca tcccccgcaa cgtgttgcat tactaccgtc accttcacag 
25141 ctaagaaaaa gcaagtaaga ggagtcgccg gaggaggcct gaggatcgcg gcgaacgagc 
25201 cctcgaccac cagggagctg aggaaccgga tcttccccac tctttatgcc atttttcagc 
25261 agagtcgagg tcagcagcaa gaactgaaag taaaaaaccg gtctctgcgc tcgctcaccc 
25321 gcagttgctt gtaccacaaa aacgaagatc agctgcagcg cactctcgaa gacgccgagg 
25381 ctctgttcca caagtactgc gcgctcactc ttaaagacta aggcgcgccc acccggaaaa 
25441 aaggcgggaa ttacctcatc gccaccatga gcaaggagat tcccacccct tacatgtgga 
25501 gctatcagcc ccagatgggc ctggccgcgg gcgcctccca ggactactcc acccgcatga 
25561 actggctcag tgccggcccc tcgatgatct cacgggtcaa cggggtccgt aaccatcgaa 
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25621 accagatatt gttggagcag gcggcggtca cctccacgcc cagggcaaag ctcaacccgc 
25681 gtaattggcc ctccaccctg gtgtatcagg aaatccccgg gccgactacc gtactacttc 
25741 cgcgtgacgc actggccgaa gtccgcatga ctaactcagg tgtccagctg gccggcggcg 
25801 cttcccggtg cccgctccgc ccacaatcgg gtataaaaac cctggtgatc cgaggcagag 
25861 gcacacagct caacgacgag ttggtgagct cttcgatcgg tctgcgaccg gacggagtgt 
25921 tccaactagc cggagccggg agatcgtcct tcactcccaa ccaggcctac ctgaccttgc 
25981 agagcagctc ttcggagcct cgctccggag gcatcggaac cctccagttc gtggaggagt 
26041 ttgtgccctc ggtctacttc aaccccttct cgggatcgcc aggcctctac ccggacgagt 
26101 ttataccgaa cttcgacgca gtgagagaag cggtggacgg ctacgactga atgtcccatg 
26161 gtgactcggc tgagctcgct cggttgaggc atctggacca ctgccgccgc ctgcgctgct 
26221 tcgcccggga gagctgcgga ctcatctact ttgagtttcc cgaggagcac cccaacggcc 
26281 ctgcacacgg agtgcggatc accgtagagg gcaccaccga gtctcacctg gtcaggttct 
26341 tcacccagca acccttcctg gtcgagcggg accggggagc taccacctac accgtctact 
26401 gcatctgtcc taccccgaag ttgcatgaga atttttgctg tactctttgt ggtgagttta 
26461 ataiaaagctg aactaagaac cttctttgga atcccttgtc atcatcaaat caacaagacc 
26521 atcaacttca cctttgagga acaggtgaac tttacctgca agccacacaa gaagtacatc 
26581 atctggtttt atcacaacac tactctagca gtagccaaca cctgctcgaa cgacggtgtt 
26641 ctcctaccta acaatctcac cagtgsracta accttctcag ttaaaagggc aaagctaatt 
26701 cttcatcgcc ctattgtaga aggaacttac cagtgtcaga gcggaccttg cttccacagt 
26761 ttcactttgg tgaacgttac cggcagcagc acagccgctc cagaaacatc taaccttctt 
26821 tctgatacta acaaacctcg tgtcggaggt gagctttggg ttccatctct aacagagggt 
26881 gggagttcta ttgaagtggt tgggtatttg attttagggg tggtcattgg tgggtgcata 
26941 gcagtgctgt atcaacttcc ttgctgggtc gaaatcaggg tatttatctg ctgggtcaga 
27001 cattgtgggg aggaaccatg aaggggctct tgctgattat cctttccctg gtggggggtg 
27061 tgctgtcatg ccacgaacag ccacgatgta acattaccac aggceiatgag aggaacgact 
27121 gctctgtagt tatcaaatgc gagcaccatt gtcctctcaa catcacattc aagaatp.aga 
27181 ccatgggaaa tgtatgggtg ggattctggc aaccaggaga tgagcagaac tacacggtca 
27241 ctgtccatgg tagcgatggc aatcacactt tcggtttcaa attcattttt gaagtcatgt 
27301 gtgatatcac actacatgtg gctagacttc atggcttgtg gccccctacc aaggagaaca 
27361 tggtgggttt ttctttggct tttgtgatca tggcctgctt gatgtcaggt ctgctggtag 
27421 gggctctagt gtggtttctg aaacgcaagc ccaggtacgg aaatgaggag aaggaaaaat 
27481 tgctataaat tctttttctc ttcgcacaac catgaataca gtgttccgta tcgtgctgct 
27541 ctctcttctt gtagctttcg gtcaggcagg aattcatatt attaatgcta catggtggga 
27601 taatataact ttagtgggac cctcagatac tccagttacc tggtatgatg gcaagggatt 
27661 gcaattttgt gacggaagta cagttaagaa tccgcagatc agacatactt gtaatgatca 
27721 aaacttaact ctgattcatg ttaacaaaac ccatgaaaga acatacatg^ gttacagaca 
27781 tgacagtaag ggaaaagtag actataaggt tacagtcatt ccacctcctc ctgctactgt 
27841 aaagccacaa ccagatccag aaaatgtctt tgtttatatg ggaaataatg taactttagt 
27901 tggacctcca ggaattccag ttagttggta ttatcataat ggcacacagt tctgcgatgg 
27961 agataaaatt attcatccag aattcaacca cacctgtgat aaacaaaacc ttacactgct 
28021 gtttgtaaac tttacacatg atggaggcta tcttggattc aattacaaag gtactcagag 
28081 aattcagtat gaggttatag ttttagatcg atttccaaat tctggtcaga tgaaaattga 
28141 agaacaaagt gaggaaacag aacagaaaca tactgagcat aataaggctg gacaaaagca 
28201 gggtatagat acaaatcaaa agaaagctaa taacagacaa aagccatctc aaaggccatc 
28261 aagaagacgg ccgacaaaca ctcctgagac aaaacaactt acagtgtcta ttgggtctaa 
28321 cttaacttta gttggtccag atggaaaagt cacttggtat gatggtgatt taaaaagacc 
28381 atgtgaagaa caaaactata ggcttccaca tcagtgtagt gctcagaact taactttaat 
28441 taatgtaact aaatctcatg agggaactta ctatggcact aatgacaaag acgaaagcaa 
28501 aagatacaga gtgaaagtga acactacaaa ttctcaagct gtaaaaatta acccatatac 
28561 cagacctact actcctgatc agaaacacag atttgaatta caaattgaaa ataatgcaaa 
28621 tgatgaagaa tcaaaaattc catctactac tgtggcaatc gtggtgggag tgattgcggg 
28681 cttcataact ataatcattg tcattctgtg ctacatctgc tgccgcaagc gtcccagggc 
28741 atacaatcat atggtagacc cactactcag cttctcttac tgagactcag tcactttcat 
28801 ttcagaacca tgaaggcttt cacagcttgc gttctgttta acataatcac acttagtgta 
28861 gctgcaaatg gttttaaaca tgttaatgtt accagattaa gtaatgtaac actgacagga 
28921 gctggaatta atactacatg gacagggtat tttaatgagg gtccaaaagg aaaaaatggg 
28981 tggatgaata tttgcacatg gggcgatcct agatatgtgt gccatggaaa tagcagtact 
29041 attactaatc ttacagttgt ggcacttcta aatttaacca ctaacagaag atttaaagca 
29101 gaaagtttta ctagtaacga tggttatgaa actaccagtg caaaatttta tgaaattaaa 
29161 attattgagc ttccaacaac tagagcaccc accacagtta ggacaacaca gcctaccact 
29221 gtgcccacta cacatccaac caccacagtc agtacaacta ttgagaccac tactcatact 
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29281 acacagctag acacaacagt gcagaatact actttettga ttgggttttt actgagagga 
29341 aatgaLgta ctactgaaca gacagaggct acctcaagtg ccttcagcag cactgcaaat 
29401 ?talctt?gc ttgcttggac taatgaaacc ggagtatcat t^atgaatcg acagccttac 
29461 tcaggtttgg atattcaaat tacttttctg gttgtctgtg ggatctttat tcttgcggtt 
29521 cttclgtact ttgtctgctg caaagccaga gagaaatcta ggcggcccat atacaggcca 
29581 gtaatcgggg aacctcagcc tctccaagtg gatggaggct taaggaatct tctcttctct 
29641 ?tScagtat ggtgatcagc catgattcct aggttcttcc tatttaacat cctgttctgt 
29701 ctcttcLca tctgtgctgc cttcgcggco gtctcgcacg cctcgcccga ctgtctaggg 
29761 cctttcccaa catacctcct ctttgccctg ctaacctgca cctgcgtctg cagcattgtc 
29821 tgcgtggtca tcacctttct gcagctcatc gactggtgct gcgcgcgcta caattatctc 
29881 ScLcagtc ccgaatacag ggacgagaac gtagccagaa tcttaaggct "tctgacca 
29941 tgcagcctct gctcatgctg atatccctcc tatccoctgc ccttgccact tctgctgatt 
30001 actctaaatg caaattcgcg gacatatgga atttcttaga ttgctatcag gagaaaattg 
30061 atatgccctc otattacttg gtgattgttg gggtagtcat ggtctgctca tgcactttct 
30121 ttgccattat gatctacccc tgttttaatc ttggotggaa ctctgttgag gcattcacat 
30181 acacactaga aaacagttca ctagcctcca cgccaccacc cacaccgcct ccccgcagaa 
30241 atcagttccc tatgattcag tacttagaag agccccctcc ccggccccct tccactgtta 
30301 gctactttca cataaccggc ggcgatgact gaccacctgg acctcgagat ggacggccag 
30361 gcctccgagc agcgcatcct gcaactgcgc gtccgacagc agcaggagcg ggccgccaag 
30421 gagctcctcg atgccatcaa catccaccag tgcaagaagg gcatcttctg cctggtcaag 
30481 caggcaaaga tcacctacga gctcgtgtcc ggcggcaagc agcatcgcct cgcctatgag 
30541 ctaccccagc agaagcaaaa gttcacctgc atggtgggcg tcaaccccat agtcatcacc 
30601 cagcagtcgg gcgagaccaa cggctgcatc cactgctcct gcgaaagccc cgagtgcatc 
30661 taStccctcc Laagaccct ttgcggactc cgcgacctcc tccccatgaa ctgatgttga 
30721 ttaaaagccc aaaaaccaat caaacccttc cccaattact cataagaata aatcattgga 
30781 actaatcatt caataaagat cacttacttg aaatctgaaa gtatgtctct ggtgtagttg 
30841 ttcagcagca cctcggaacc ctcctcccag ctctggtact ccagtccccg gcgggcggcg 
loToi aS?cctco acaccitgaa agggatgtca aattcctggt ccacaatttt cattgtcttc 
30961 cctcagatga caaagaggct ccgggtggaa gatgacttca accccgtcta cccctatggc 
31021 tacgcgcgga atcagaatat ccccttcctt actcccccct ttgtttcttc cgatggattc 
31081 caaaacttcc cacctggggt cctgtcactc aaactggctg acccaatcgc catcactaat 
31141 ggggatgttt cactcaaggt gggagggggt cttactgttg aaaaagatag tggaaatcta 
31201 aaggtgaacc ctaaggctcc cttgcaagtt acaactgata aacagttgga aattgcactg 
31261 gcttatccat ttgaagtcag taatggcaag cttggcataa aagcaggtca tggattgaaa 
31321 gtcattgaca aaattgctgg tttggaaggt ttggcaggta cgcttgtagt tttgactgga 
31381 aaaagaatag gtactgaaaa tcttgaaaac agtgatgggt caagtagagg agttggtata 
31441 ScgLagac ?tgctaaaga tggaggtctg tcttttgata aaaagggtga tttagttgct 
31501 tggaataaac atgatgacag acgcactcta tggacaactc ccgacccatc cccaaattgt 
31561 acaatcgatc aggaaaggga ttcaaagctc actttagtat taacaaaatg tggcagtcaa 
31621 attttggcta atgtctcttt acttgttgta aaaggaaaat ttagtaacat aaaeaataat 
31681 actaatccaa ctgataaaaa aatcacagta aagctacttt ttaatgaaaa gggagtatta 
31741 atggacagtt cgacacttaa gaaagaatat tggaactaca gaaatgataa ttctactgta 
31801 tctcaggcct atgataatgc agttcctttt atgccaaaca taaaagctta tcctaaacct 
31861 accacagaca cttcggctaa accagaagat aaaaaaagtg ctgctaaaag atacattgtg 
31921 agcaatgtct atattggagg cttgccagat aaaactgttg ttataactat taagtttaat 
31981 gcagaaactg aatgtgctta ttcgattacc tttgaattca catgggcaaa aacctttgaa 
32041 aatgtgcagt ttgattcctc ctcttttacc ttttcctata ttgcccaaga aaatgaggac 
32101 gaagacaaat aaaatgtttt aaaatgaatt catgtatctt tattgatttt tacaccagca 
32161 cgggtagtca gtctcccacc accagcccat ttcacagtgt aaacgattct ctcagcacgg 
32221 gtggccttaa atagggaaat gttctgatta gtgcgggaac tggacttggg gtctataatc 
32281 cacacagttt cctggcgagc caaacggggg tcggtgattg agatgaagcc gtcctctgaa 
32341 aagtcatcca agcgggcctc acagtccaag gtcacagtct ggtgaaacga gaagaacgca 
32401 cagattcata ctcggaaaac aggatgggtc tgtgcctctc catcagcgcc ctcaacagtc 
32461 tctgccgccg gggctcggtg cggctgctgc agatgggatc gggatcacaa gtctctctga 
32521 ctatgatccc cacagccttc agcatcagtc tcctggtgcg tcgggcacag caccgcatcc 
32581 tgatctcgct catgttctca cagtaagtgc agcacataat caccatgtta ttcagcagcc 
32641 cataattcag ggtgctccag ccaaaactca tgttggggat gatggaaccc acgtgaccat 
32701 cgtaccagat gcggcagtat atcagatgcc tgcocctcat gaacacactg cccatataca 
32761 tgatctcttt gggcatgtct ctgttcacaa tctgacggta ccagggaaag cgctggttga 
32821 acatgcaccc gtaaatgact ctcctgaacc acacggccag cagggtgcct cccgcccgac 
32881 actgcaggga gcccggggat gaacagtggc aatgcaggat ccagcgctcg tacccgctca 
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32941 ccatctgagc tctcaccaag tccagggtag cggggcacag gcacactgac atacatcttt 
33001 ttaaaatttt tatttcctct ggagtcaaga tcatatccca ggggactgga aactcttgga 
33061 gcagggtaaa gccagcagca catggtaatc cacggacaga acttacatta tgataatctg 
33121 catgatcaca atcaggcaac aggggatgtt gttcagtcag tgaagccctg gtttcctcat 
33181 cagatccrtgg taaacgggcc ctgcgatatg gatgatggcg gagcgagctg gattgaatct 
33241 cggtttgcat tgtagtggat tctcttgcgt accttgtcgt acttctgcca gcagaaatgg 
33301 gcccttgaac agcagatacc cctcctgcgg ccgtcctttc gctgctgccg ctcagtcatc 
33361 caactgaagt acatccattc tcgaagattc tggagaagtt cctctgcatc tgatgaaaca 
33421 aaaaacccgt ccatgcgaat tcccctcatc acatcagcca ggactctgta- ggccatcccc 
33481 atccagttaa tgctgccttg tctatcattc agagggggcg gtggcaggat tggaagaacc 
33541 atttttattc caaacggtct cgaaggacga taaagtgcaa gtcacgcagg tgacagcgtt 
33601 cccctccgct gtgctggtgg aaacagacag ccaggtcaaa acccactcta ttttcaaggt 
33661 gctcgaccgt ggcttcgagc agtggctcta cgcgtacatc cagcataaga atcacattaa 
33721 aggctggccc tccatcgatt tcatcaatca tcaggttaca ttcctgca.cc atccccaggt 
33781 aattctcatt tttccagcct tggattatct ctacaaattg ttggtgtaag tccactccgc 
33841 acatgtggaa aagctcccac agtgccccct ccactttcat aatcaggcag accttcataa 
33901 tagaaacaga tcctgctgct ccaccacctg cagcgtgttc aaaacaacaa gattcaataa 
33961 ggttctgccc tccgccctga gctcgcgcct caatgtcagc tgcaaaaaat cacttaagtc 
34021 ctgggccact acagctgaca attcagagcc agggctaagc gtgggactgg caagcgtaag 
34081 ggaaaacttt aatgctccaa agctagcacc caaaaactgc atgctggaat aagctctctt 
34141 tgtgtctccg gtgatgcctt ccaaaatgtg agtgataaag cgtggtagtt tttctttaat 
34201 catttgcgta atagaaaagt cctgtaaata agtcactagg accccaggga ccacaatgtg 
34261 gtagcttaca ccgcgtcgct gaagcatggt tagtagagat gagagtctga aaaacagaaa 
34321 gcatgcacta aactaaggtg gctattttca ctgaaggaaa aatcactctc tccaacaaca 
34381 gggtacccac tgggtggccc ttgcggacat acaaaaatcg gtccgtgtga ttaaaaagca 
34441 gcacagtaag ttcctgtctt cttccggcaa aaatcacatc ggactgggtt agtatgtccc 
34501 tggcatggta gtcattcaag gccataaatc tgccctgata tccagtagga accagcacac 
34561 tcacttttag gtgaagcaat accaccccat gcggaggaat gtggaaagat tcagggcaaa 
34621 aaaaattata tctattgcta gtcccttcct ggacgggagc aatccctcca ggactatcta 
34681 tgaaagcata cagagattca gccatagctc agcccgctta ccagtagaca gagagcacag 
34741 cagtacaagc gccaacagca gcgactgact acccactgac ccagctccct atttaaaggc 
34801 gccttacact gacgtaatga ccaaaggtct aaaaaccccg ccaaaaaaaa acacacacgc 
34861 cctgggtgtt ttttgcgaaa acacttccgc gttctcactt cctcgtattg atttcgtgac 
34921 ttaacttccg ggttcccacg ttacgtcact tctgccctta catgtaactc agtcgtaggg 
34981 cgccatcttg cccacgtcca aaatggcttc catgtccagc cacgcctccg cggcgaccgt 
35041 tagccgtgcg tcgtgacgtc atttgcatca tcttctctcg tccaatcagc gctggccccg 
35101 ccctaaattc aaaagctcat ttgcatgtta acttttgttt actttgtggg gtatattatt 
35161 gat gate 
SEQ ID NO: 5 
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Ampr on Ad34 ITR cassette 



(bp 31584 to 31 
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1 catcatcaat aatatacctt atagatggaa tggtgccaat atgtaaatga ggtgatttta 
61 aaaattgtgg ggtgtgtggt gattggctgt ggggttaacg gctaaacggg gcggcgcggc 
121 cgtgggaaaa tgacgttttg tgggggtgga gtttttttgc aagttgtcgc gggaaatgtg 
181 acgcataaaa aggctttttt tctcacggaa ctactgactt ttcccacggt atttaacagg 
.241 aaatgaggta gttttgaccg gatgcaagtg aaaattgctg atttgcgcgc gaaaactgaa 
301 tgaggaagtg tttttctgaa taatgtggta tttatggcag ggtggagtat ttgttcaggg 
361 ccaggtagac tttgacccat tacgtggagg tttcgattac cgtgtttttt acctgaattt 
421 ccgcgtaccg tgtcaaagtc ttctgttttt acgtaggtgt cagctgatcg ctacggtatt 
481 tatacctcag ggtttgtgtc aagaggccac tcttgagtgc cagcgagaag agttttctcc 
541 tctgcgccgg cagtttaata ataaaaaaat gagagatttg cgatttctgc ctcaggaaat 
601 aatttctgct gagactggaa atgaaatact ggagcttgtg gtgcacgccc tgatgggaga 
661 cgatccggag ccacctgtgc agctttttga gcctcctacg cttcaggaac tgtatgattt 
721 agaggtagag ggatcggagg attctaatga ggaagctgtg aatggctttt ttaccgattc 
781 tatgctttta gctgctaatg aaggattaga attagatccg cctttggaca ctttcgatac 
841 tccaggggtg attgtggaaa gcggtacagg tgtaagaaaa ttacctgatt tgggttccgt 
901 ggactgtgat ttgcactgct atgaagacgg gtttcctccg agtgatgagg aggaccatga 
961 aaaggagcag tctatgcaga ctgcagcggg tgagggagtg aaggctgcca gtgttggttt 
1021 tcagttggat tgcccggagc ttcctggaca tggctgtaag tcttgtgaat ttcacaggaa 
1081 aaatactgga gtaaaggaac tgttatgttc gctttgttat atgagagcgc actgccactt 
1141 tatttacagt aagtgtgttt aagttaaaat ttaaaggaat atgctgtttt tcacatgtat 
1201 attgagtggg agttttgtgc ttcttattat aggtcctgtg tctgatgctg atgagtcacc 
1261 atctcctgat tctactacct cacctcctga gattcaagca cctgttcctg tggacgtgcg 
1321 caagcccatt cctgtgaagc ttaagcctgg gaaacgtcca gcagtggaaa aacttgagga 
13 81 cttgttacag ggtggggacg gacctttgga cttgagtaca cggaaacggc caagacaata 
1441 agtgttccat atccgtgttt acttaaggtg acgtcaatat ttgtgtgaga gtgcaatgta 
1501 ataaaaatat gttaactgtt cactggtttt tattgctttt tgggcgggga ctcaggtata 
1561 taagtagaag cagacctgta tggttagctc ataggagctg gctttcatcc atggaggttt 
1621 gggccatttt ggaagacctt agaaagacta ggcaactgtt agaggacgct tcggacggag 
1681 tctccggttt ttggagattc tggttcgcta gtgaattagc tagggtagtt tttaggataa 
1741 aacaggacta taaagaagaa tttgaaaagt tgttggtaga ttgcccagga ctttttgaag 
1801 ctcttaattt gggccatcaa gttcacttta aagaaaaagt tttatcagtt ttagactttt 
1861 caaccccagg tagaactgcc gctgctgtgg cttttcttac ttttatatta gataaatgga 
1921 tcccgcagac tcatttcagc aggggatacg ttttggattt cgtagccaca gcattgtgga 
1981 gaacatggaa ggttcgcaag atgaggacaa tcttaggtta ctggccagtg cagcctttgg 
2041 gtgtagcggg aatcctgagg catccaccgg tcatgccagc ggttctggag gaggaacagc 
2101 aagaggacaa cccgagagcc ggcctggacc ctccagtgga ggaggcggag tagctgactt 
2161 gtctcctgaa ctgcaacggg tgcttactgg atctacgtcc actggacggg ataggggcgt 
2221 taagagggag agggcatcta gtggtactga tgctagatct gagttggctt taagtttaat 
2281 gagtcgcaga cgtcctgaaa ccatttggtg gcatgaggtc cagaaagagg gaagggatga 
2341 agtttctgta ttgcaggaga aatattcact ggaacaggtg aaaacatgtt ggttggagcc 
2401 tgaggatgat tgggaggtgg ccattaaaaa ttatgccaag atagctttga ggcctgataa 
2461 acagtataag attactagac ggattaatat ccggaatgct tgttacatat ctggaaatgg 
2521 ggctgag'gtg gtaatagata ctcaagacaa ggcagttatt agatgctgca tgatggatat 
2581 gtggcctgga gtagtcggta tggaagcagt aacttttgta aatgttaagt ttaggggaga 
2641 tggttataat ggaatagtgt ttatggccaa taccaaactt atattgcatg gttgtagctt 
2701 ttttggtttc aacaatacct gtgtagatgc ctggggacag gttagtgtac ggggatgtag 
2761 tttctatgcg tgttggattg ccacagctgg cagaaccaag agtcaattgt ctctgaagaa 
2821 atgcatattc caaagatgta acctgggcat tctgaatgaa ggcgaagcaa gggtccgcca 
2881 ctgcgcttct acagatactg gatgttttat tttaattaag ggcaatgcca gcgtaaagca 
2941 taacatgatt tgcggtgctt ccgatgagag gccttatcaa atgctcactt gtgccggtgg 
3001 gcattgtaat atgctggcta ctgtgcatat tgtttcccat caacgcaaaa aatggcctgt 
3061 ttttgatcac aatgtgttga ccaagtgtac catgcatgca ggtgggcgta gaggaatgtt 
3121 tatgccttac cagtgtaaca tgaatcatgt gaaagtgttg ttggaaccag atgccttttc 
3181 cagaatgagc ctaacaggaa tctttgacat gaacatgcaa atctggaaga tcctgaggta 
3241 tgatgatacg agatcgaggg tgcgcgcatg cgaatgcgga ggcaagcatg ccaggttcca 
3301 gccggtgtgt gtagatgtga ctgaagatct gagaccggat catttggtta ttgcccgcac 
3361 tggagcagag ttcggatcca gtggagaaga aactgactaa ggtgagtatt gggaaaactt 
3421 ggggtggggt tttcagatgg acagattgag taaaaatttg ttttttctgt ctttcagctg 
3481 tcatgagtgg asiacgcttct tttaaggggg gagtcttcag cccttatctg acagggcgtc 
3'541 tcccatcctg ggcaggagtt cgtcagaatg ttatgggatc tactgtggat ggaagacccg * 
3601 tccaacccgc caattcttca acgctgacct atgctacttt aagttcttca cctttggacg 
3661 cagctgcagc cgccgccgcc gcctctgttg ccgctaacac tgtgcttgga atgggttact 
3721 atggaagtat cgtggctaat tccacttcct ctaataaccc ttctaccctg actcaggaca 
3781 agttacttgt ccttttggcc cagctggagg ctttgaccca acgtctgggt gaactttatc 
3841 agcaggtggc cgagttgcga gtacaaactg agtctgctgt cggcacggca aagtctaaat 



FIG. 28A-1 



wo 2004/018627 



PCT/US2003/026145 



47/59 

3901 aaaaaaaaat tccacaatca atgaataaat 
3961 tgtttttatt tcatttttcg cgcacggtat 
4021 acacggtgga ttttttccag aatcctatag 
4081 attaggccat ctttggggtg gagatagctc 
4141 ttgtaaatca cccagtcata acaaggtcgc 
4201 agtaggctga ttgccacaga taagcccttg 
4261 gaggggtgca ttcggggtga aattatgtgc 
4321 ttgccgccaa gatctcgtct tgggttcatg 
4381 gtacatttag gaaatttatc gtgtagcttg 
4441 cccttgtgtc ctccgagatt ttccatgcac 
4501 gcagcagcgc gggcaaacac gttccgtggg 
4561 aaatcatcat aagccatttt aatgaatttg 
4621 gttccttcgg gccccggagc atagttcccc 
4681 tccgatggtg gaatcatgtc cacctggggg 
4741 gtgattagtt gggatgatag-caagtttctg 
4801 ccataaatga ttccgattac aggttgcagg 
4861 tctcgaagca agggggccac ctcgttcatc 
4921 aaatccatta ggaggcgctc tcctcctagt 
4981 ttcagcggtt ttagaccgtc agccatgggc 
5041 agtctgttcc acagttcagt gatgtgttct 
5101 ttcgcgggtt tggacggctc ctggagtagg 
5161 gggttcggtc cttccagggt ctcagtgttc 
5221 ggtgtgcgcc tgcttgggcg cttgccaggg 
5281 acttctgtcg cttggcgccc tgtatgtcgg 
5341 tgagcgcctc ggctgcgtgg cctttggcgc 
5401 ccgggcagta taggcatttc agcgcataca 
5461 agtatgcatc tgcgccgcag gaggcgcaaa 
5521 ccggttcatt ggggtcaaaa acaagttttc 
5581 tggtctccat gagttcgtgt cctcgttgag 
5641 ctgattttac aggcctcttc tccagtggag 
5701 accactctga tacaaaggcg cgcgtccagg 
5761 agcgatcgtt gtcaaccagg gggtccacct 
5821 cttcaacatc caggaatgtg attggcttgt 
5.881 ctgggggggt ataaaagggg gcggttcttt 
5941 ccaggaacgt cagctgttgg ggtaggtatt 
6001 tcaggttgtc agtttctaag aacgaggagg 
6061 ctttcatgag gttttcgtcc atttggtcag 
6121 tggcaaatga tccatacagg gcgttggata 
6181 tcttttcctt gtccgcgcgc tctttggcag 
6241 ggcacttcca ttcggggaag atagttgtca 
6301 ctcgattatg caaggtaatt aaatccacac 
6361 tggtccaaca gagcctacct cctttcctag 
6421 gttcatcggg agggtctgca tccatggtaa 
6481 agctgatggg agtggggtca tctaaggcca 
6541 catatgggtt aaggggactg ccccagggca 
6601 cacagatgtc atagacgtag atgggatcct 
6661 gcccccctct gatacttgct cgcacatagt 
6721 ccggacccaa gttggtgcga ttgggttttt 
6781 cgtgagaatt ggaagagatg gtgggtcttt 
6841 ctacagagtc tctgacaaag tgggcataag 
6901 tgacaagtac gtctagggcg cagtagtcaa 
6961 ggtttttctt ttcccacagt tcgcggttga 
7021 cttctagcgg aaacccgtct ttgtctgcac 
7081 ctgccttgta agggcagcag cccttctcta 
7141 gcagcgaagc gtgagtaagg gcgaaggtgt 
7201 tgaagtccat gtcgtcacag gctccctgtt 
7261 aggcggggtt gggcaaagcg aaagtaacat 
7321 taaaattgcg agtgatgcgg aaaggctgtg 
7381 cagctaggac gatctcgtcg aaaccgttga 
7441 aacgcggcgt gcctttgacg tgaggtagct 
7501 ggtcagataa ggcgtagtgt tcgagagccc 
7561 atgatgacca aagatccacc gccagtgctg 
7621 gctggccaat tgccattttt tctggagtga 
7681 atcgatccca ctttagttta atggctagat 
7741 ctgagagttt catgaccagc atgaaaggaa 



aaacgagctt gttgttgatt taaaatcaag 
gccctagacc accgatctcg atcattgaga 
aggtgggatt gaatgtttag atacatgggc 
cattgaaggg attcatgctc cggggtagtg 
agtgcatggt gttgcacaat atcttttaga 
gtgtaggtgt ttacaaaccg gttgagctgg 
attttggatt ggatttttaa gttggcaata 
ttatgaagga ccaccaagac ggtgtatccg 
gatggaaaag cgtggaaaaa tttggagaca 
tcatccatga taatagcaat ggggccgtgg 
tctgacacat catagttatg ttcctgagtt 
gggcggagag tacccgattg gggtatgaat 
tcacagattt gcatttccca agctttcagt 
gctatgaaga acaccgtttc tggggcgggg 
agcaattgag atttgccaca tccggtgggg 
tggtagttta gggaacggca actgccgtct 
atttccctta catgcatatt ttcccgcacc 
gatagaagtt cttgtagtga ggaaaagttt 
attttggaga gagtttgctg caaaagttct 
atggcatctc gatccagcag acctcctcgt 
gtatgagacg atgggcgtcc agcgctgcca 
gagtcagggt tgtttccgtc acagtgaagg 
tgcgcttcag actcattctg ctggtggaga 
ccaagtagca gtttaccatg agttcgtagt 
ggagcttacc tttggaagtt ttcttgcata 
gcttgggcgc aaggaaaatg gattctgggg 
cagtttcaca ttccaccagc caggttaaat 
cgccatattt tttgatgcgt ttcttacctt 
tgacaaacag gctgtccgta tccccgtaga 
tgcctcggtc ttcttcgtac aggaactctg 
ccagcacaaa ggaggctatg tgggaggggt 
tttccaaagt atgcaaacac atgtcaccct 
aggtgtattt cacgtgacct ggggtccccg 
gctcttcctc actgtcttcc ggatcgctgt 
ccctctcgaa ggcgggcatg acctctgcac 
atttgatatt gacagtgccg gttgagatgc 
aaaacacaat ttttttattg tcaagtttgg 
aaagtttggc eiatggatcgc atggtttggt 
cgatgttgag ttggacatac tcgcgtgcta 
attcatctgg cacgattctc acttgccacc 
tggtggccac ctcgcctcga aggggttcgt 
aacagaaagg gggaagtggg tctagcataa 
agattcccgg aagtaaatcc ttatcaaaat 
tttgccattc tcgagctgcc agtgcacgct 
tgggatgggt gagtgcagag gcatacatgc 
caaagatgcc tatataggtt ggatagcatc 
catatagttc atgtgatggc gctagcaacc 
ctgttctgta gacaatctgg cgaaagatgg 
gaaaaatgtt gaaatgggca tgaggtagac 
attcttgaag cttggttacc agttcggcgg 
gtgtttcttg aatgatgtca taacctggtt 
gaaggtattc ttcgcgatcc ttccagtact 
ggtaagatcc tagcatgtag aactgattaa 
cgggtagaga gtatgcttga gcagcttttc 
ctctgaccat gactttgaga aattggtatt 
cccagagttg gaagtctacc cgtttcttgt 
cgttgaagag aatcttaccg gctctgggca 
gtacttccgc tcgattgttg atcacctggg 
tgttgtgtcc tacgatgtat aattctatga' 
tattgagctc atcaaaggtt aggtctgtag 
attcgtgcag gtgaggattt gcatgtagga 
tttgtaactg gtcccgatac tgacgaaaat 
cacagtagaa ggttctgggg tcttgttgcc 
cgtgggccat gttgacgaga cgctcttctc 
ctagttgttt gccaaaggac cccatccagg 
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7801 tgtaagtttc cacatcgtag gtcaggaaga gtctttctgt gcgaggatga gagccgatcg 
7861 ggaagaactg gatttcctgc caccagttgg aggattggct gttgatgtga tggaagtaga 
7921 agtttctgcg gcgcgccgag cattcgtgtt tgtgcttgta cagacggccg cagtagtcgc 
7981 agcgttgcac gggttgtatc tcgtgaatga gctgtacctg gcttcccttg acgagaaatt 
8D41 tcagtgggaa gccgaggcct ggcgattgta tctcgtgctc ttctatattc gctgtatcgg 
8101 cctgttcatc ttctgtttcg gtggtggtca tgctgacgag cccccgcggg aggcaagtcc 
8161 agacctcggc gcgggagggg cggagctgaa ggaccagagc gcgcaggctg gagctgtcca 
8221 gagtcctgag acgctgcgga ctcaggttag taggtaggga cagaagatta acttgcatga 
8281 tcttttccag ggcgtgcggg aggttcagat . ggtacttgat. ttccacaggt tcgtttgtag. 
8341 agatgtcaat ggcttgcagg gttccgtgtc ctttgggcgc cactaccgta cctttgtttt 
8401 ttcttttgat cggtggtggc tctcttgctt cttgcatgct cagaagcgat gacggggacg 
8461 cgcgccgggc ggaagcggtt gttccggacc cggaggcatg gctggtagtg gcacgtpggc 
8521 gccgcgcacg ggcaggttct ggtactgcgc tctgagaaga cttgcgtgcg ccaccacgcg 
8581 tcgattgacg tcttgtatct gacgtctctg ggtgaaagct accggccccg tgagcttgaa 
8641 cctgaaagag agttcaacag aatcaatttc ggtatcgtta acggcagctt gtctcagtat 
8701 ttcttgtacg tcaccagagt tgtcctggta ggcgatctcc gccatgaact gctcgatttc 
8751 ttcctcctga agatctccgc gacccgctct ctcgacggtg gccgcgaggt cattggagat 
8821 acggcccatg agttgggaga atgcagtcat gcccgcctcg ttccagacgc ggctgtaaac 
8881 cacggccccc tcggagtctc ttgcgcgcat caccacctga gcgaggttaa gctccacgtg 
8941 tctggtgaag accgcatagt tgcataggcg ctgaaaaagg tagttgagtg tggtggcaat 
9001 gtgttcggcg acgaagaaat acatgatcca tcgtctcagc ggcatttcgc tgacatcgcc 
9061 cagagcttcc aagc'gctcca tggcctcgta gaagtccacg gcaaaattaa aaaactggga 
9121 ertttcgcgcg gacacggtca at tec tec tc gagaagaegg atgagttcgg ctatggtggc 
9181 ccgtacttcg cgttcgaagg ctcccgggat ctcttcttcc tettctatct cttcttccac 
9241 taacatctct tcttcgtctt caggcggggg cggagggggc acacggcgac gtcgacggcg 
9301 cacgggcaaa cggtcgatga atcgttcaat gacctctccg cggcggcggc gcatggtttc 
9361 agtgacggcg cggccgttct cgcgcggtcg cagagtaaaa acaccgccgc gcatctpctt 
9421 aaagtggtga ctgggaggtt ctccgtttgg gagggagagg gcgctgatta tacattttat 
9481 taattggccc gtagggactg cgcgcagaga tctgatcgtg tcaagatcca cgggatctga 
9541 aaacctttcg acgaaagcgt ctaaccagtc acagtcacaa ggtaggctga gtacggcttc 
9601 ttgtgggcgg gggtggttat gtgttcggtc. tgggtcttct gtttcttctt catctcggga 
9661 aggtgagacg atgctgctgg tgatgaaatt aaagtaggca gttctaagac ggcggatggt 
9721 ggcgaggagc accaggtctt tgggtccggc ttgctggata cgcaggcgat tggccattcc 
9781 ccaagcatta tcctgacatc tagcaagatc tttgtagtag tcttgcatga gccgttctac 
9841 gggcacttct tcctcacccg ttctgccatg catacgtgtg ag1:ccaaacc cgcgcattgg 
9901 ttgtaecagt gccaagtcag ctacgactct ttcggcgagg atggcttgct gtacttgggt 
. 9'961 gagggtggct tgaaagtcat caaaatccac aaagcggtgg taagccccgg tattaatggt 
10021 gtaagcacag ttggccatga ctgaccagtt aactgtctgg tgaccagggc gcacgagctc 
10081 ggtgtattta aggcgcgaat aggcgcgggt gtcaaagatg taatcgttgc aggtgccfcac 
10141 cagatactgg taacctataa gaaaatgcgg cggtggttgg cggtagagag gccatcgttc 
10201 tgtagctgga gcgccggggg cgaggtcttc caacataagg cggtgatagc cgtagatgta 
10261 cctggacatc caggtgattc ctgcggcggt agtagaagcc cgaggaaact cgcgtacgcg 
10321 gttccaaatg ttgcgtagcg gcatgaagta gttcattgta ggcacggttt gaccagtgag 
10381 gcgcgcgcag tcattgatgc tctatagaca cggagaaaat gaaagcgttc agcgactcga 
10441 ctccgtagcc tggaggaacg tgaacgggtt gggtcgcggt gtaccccggt tcgagacttg 
10501 tactcgagcc ggccggagcc gcggctaacg tggtattggc actcccgtct cgacccagcc 
10561 tacaaaaatc caggatacgg aatcgagtcg ttttgctggt tgccgaatgg cagggaagtg 
10621 agtcctattt tttttttttg ccgctcagat gcatcccgtg ctgcgacaga tgcgtcccca 
10681 acaacagccc ccctcgcagc agcagcaacc acaaaaggct gtccctgcaa ctactgcaac 
10741 tgccgctgtg agcggtgcgg gacagcccgc ctatgatctg gacttggaag agggcgaagg 
10801 actggcacgt ctaggtgcgc cttcgcccga gcggcatccg cgagttcaac tgaaaaaaga 
10861 ttctcgcgag gcgtatgtgc cccaacagaa cctatttaga gacagaagcg gcgaggagcc 
10921 ggaggagatg cgagcttccc gctttaacgc gggtcgtgag ctgcgtcacg gtttggacag 
10981 aagacgagtg ttgcgggacg aggatttcga agttgatgaa gtgacaggga tcagtcctgc 
11041 cagggcacac gtggctgcag ccaaccttgt atcggcttac gaacagacag taaaggaaga 
11101 gcgtaatttc caaaagtctt ttaataatca tgtgcgaacc ctcattgccc gcgaagaagt 
11161 cacccttggt ttgatgcatt tgtgggattt gatggaagct atcattcaga accctactag 
11221 caaacctctg accgcacagc tgtttctggt ggtgcaacac agcagagaca atgaggcttt 
11281 cagagaggcg ctgctcaaca . tcaccgraacc cgaggggaga tggttgtatg atcttatcaa 
11341 cattctacag agtatcatag tgcaggagcg gagcctgggc ctggccgaga aggtggctgc 
11401 catcaattac tcggttttga gcttgggaaa gtattacgct cgcaagatct acaagactcc 
11461 atacgttccc atagacaagg aggtgaagat agatgggttc tacatgcgca tgacgctgaa 
11521 ggtgttgacc ctgagcgatg atcttggggt gtaccgcaat gacagaatgc atcgcgcggt 
11581 gagcgccagc aggaggcgcg agttaagcga cagggaactg atgcacagtt tgcaaagagc 
11641 tctaactgga gctggaaccg agggtgagaa ttactttgat atgggagctg acttgcagtg 
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11701 gcagcctagt cgcagggctc tgaacgccgc gacggcagga tgtgagcttc cttacataga 
11761 agaggcggat gaaggcgagg aggaagaggg cgagtacttg gaagactgat ggcacaaccc 
11821 gtgttttttg ctagatggaa cagcaagcac cggatcccgc aatgcgggcg gcgctgcaga 
11881 gccagccgtc cggcattaac tcctcggacg attggaccca ggccatgcaa cgtatcatgg 
11941 cgttgacgac tcgcaacccc gaagccttta gacagcaacc ccaggccaac cgtctatcgg 
12001 ccatcatgga agctgtagtg ccttcccgct ctaatcccac tcatgagaag gtcctggcca 
12061 tcgtgaacgc gttggtggag aacaaagcta ttcgtccaga tgaggccgga ctggtataca 
12121 acgctctctt agaacgcgtg gctcgctaca acagtagcaa tgtgcaaacc aatttggacc 
12181 gtatgataac agatgtacgc gaagccgtgt ctcagcgcga aaggttccag cgcgatgcca 
12241 acctgggttc gctggtggcg ttaaatgctt tcttgagtac tcagcctgct aatgtgccgc 
12301 gtggtcaaca ggattatact aactttttaa gtgctttgag actgatggta tcagaagtac 
12361 ctcagagcga agtatatcag tccggtcctg attacttctt tcagactagc agacagggct 
12421 tgcagacggt aaatctgagc caagctttta aaaaccttaa aggtttgtgg ggagtgcatg 
12481 ccccggtagg agaaagagca accgtgtcta gcttgttaac tccgaactcc cgcctattat 
12541 tactgttggt agctcctttc accgacagcg gtagcatcga ccgtaattcc tatttgggtt 
12601 acctactaaa cctgtatcgc gaagccatag ggcaaagtca ggtggacgag cagacctatc 
12661 aagaaattac ccaagtcagt cgcgctttgg gacaggaaga cactggcagt ttggaagcca 
12721 ctctgaactt cttgcttacc aatcggtctc aaaagatccc tcctcaatat gctcttactg 
12781 cggaggagga gaggatcctt agatatgtgc agcagagcgt gggattgttt ctgatgcaag 
12841 agggggcaac tccgactgca gcactggaca tgacagcgcg aaatatggag cccagcatgt 
12901 atgccagtaa ccgacctttc attaacaaac tgctggacta cttgcacaga gctgccgcta 
12961 tgaactctga ttatttcacc aatgccatct taaacccgca ctggctgccc ccacctggtt 
13021 tctacacggg cgaatatgac atgcccgacc ctaatgacgg atttctgtgg gacgacgtgg 
13*081 acagcgatgt tttttcacct ctttctgatc atcgcacgtg gaaaaaggaa ggcggcgata 
13141 gaatgcattc ttctgcatcg ctgtccgggg tcattggtgc taccgcggct gagcccgagt 
13201 ctgcaagtcc ttttcctagt ctaccctttt ctctacacag tgtacgtagc agcgaagtgg 
13261 gtagaataag tcgcccgagt ttaatgggcg aagaggagta cctaaacgat tccttgctca 
13321 gaccggcaag agaaaaaaat ttcccaaaca atggaataga aagtttggtg gataaaatga 
13381 gtagatggaa gacttatgct caggatcaca gagacgagcc tgggatcatg gggactacaa 
13441 gtagagcgag ccgtagacgc cagcgccatg acagacagag gggtcttgtg tgggacgatg 
13501 aggattcggc cgatgatagc agcgtattgg acttgggtgg gagaggaagg ggcaacccgt 
13561 ttgctcattt gcgccctcgc ttgggtggta tgttgtaaaa aaaaataaaa aagaaaaaac 
13621 tcaccaaggc catggcgacg agcgtacgtt cgttcttctt tattatctgt gtctagtata 
13681 atgaggcgag tcgtgctagg cggagcggtg gtgtatccgg agggtcctcc tccttcgtac 
13741 gagagcgtga tgcagcagca gcaggcgacg gcggtgatgc aatccccact ggaggctccc 
13801 tttgtgcctc cgcgatacct ggcacctacg gagggcagaa acagcattcg ttactcggaa 
13861 ctggcacctc agtacgatac caccaggttg tatctggtgg acaacaagtc ggcggacatt 
13921 gcttctctga actatcagaa tgaccacagc aacttcttga ccacggtggt gcaaaacaat 
13981 gactttaccc ctacggaagc cagcacccag accattaact ttgatgaacg atcgcggtgg 
14041 ggcggtcagc taaaaaccat catgcatact aacatgccca acgtgaacga gtatatgttt 
14101 agtaacaagt tcaaagcgcg tgtgatggtg tccagaaaac ctcctgaggg tgttagagta 
14161 gacgataatt atgatcataa gcaagatatt ctaaaatacg agtggttcga gtttactttg 
14221 ccagaaggca acttttcggt cactatgact atcgacttga tgaacaatgc catcatagac 
14281 aattacttga aagtgggcag acagaatgga gtgttggaaa gtgacattgg tgttaagttc 
14341 gacactagga acttcaagtt gggatgggat ccagaaacta agttgatcat gcctggggtt 
14401 tacacctatg aggccttcca tcctgacatc gtattgctgc ctggctgcgg agtggacttt 
14461 accgaaagcc gtctgagcaa ccttcttggc attagaaaga aacacccatt ccaagagggt 
14521 tttaagatct tgtatgagga tttagaagga ggaaatattc cagccctttt ggatgtagat 
14581 gcttatgaga acagcaagaa agatcaaiaaa gccaaaatag aagctgctgc agaagctaaa 
14641 gcaaacatag ttgccaacga tccggtaagg gtggctaacg ctagtgaaat caggggagac 
14701 agttttgccg caacatccgt tccgactaaa gaatcattat tggatgatgt gtctcaaaac 
14761 atagagttaa aactcactat taagcctgtg gaaaaagatg gcaaaaacag aagttacaat 
14821 gtgttggaag ataaaatcaa cacggcctat cgcagttggt acctttcgta caattatggc 
14881 gaccccgaaa aaggagtgcg ttcctggaca ttgctcacca cctcagatgt cacctgcgga 
14941 gcggagcagg tctactggtc gcttccagac atgatgcagg atcctgtcac tttccgctcc 
15001 actagacaag tcagtaacta ccctgtggtg ggtgcagagc ttatgcccgt cttttcaaag 
15061 agcttctaca acgaacaagc tgtgtactcc cagcaigctcc gccagtccac ctcgcttacg 
15121 cacgtcttca accgctttcc tgagaaccag attttaatcc gtccgccggc gcccacaatt 
15181 accaccgtca gtgaaaacgt tcctgctctc acagatcacg ggaccctgcc gttgcgcagc 
15241 agtatccggg gagtccaacg tgtgaccgtt actgacgcca gacgccgcac ctgtccctac 
15301 gtgtacaagg cactgggcat agtcgcaccg cgcgtccttt caagccgcac tttctaaaaa 
15361 aaaaaaaaaa atgtccgttc ttatctcgcc cagtciataac accggttggg gtctgcgcgc 
15421 tcccagcaag atgtacggag gcgcacgcaa acgttctacc caacatcccg tgcgtgttcg 
15481 cgggcatttt cgcgctccat ggggtgccct caagggccgc actcgcgttc gaaccaccgt 
15541 cgatgatgta atcgatcagg tggttgccga cgcccgtaat tatactccta ctgcgcctac 
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15601 atctactgtg gacgcagtta 'ttgacagtgt agtggctgac gctcgcaact atgctcgacg 
15661 taagagccgg cgaaggcgca ttgccagacg tcaccgagct accactgcca tgcgagcagc 
15721 aagagctctg ctacgaagag ctagacgcgt ggggcgaaga gccatgctta gggcggccag 
15781 acgtgcagct tcgggcgcca gcgccggcag gtcccgcagg caagcagccg ctgtcgcagc 
15841 ggcgactatt gccgacatgg cccaatcgcg aagaggcaat gtatactggg tgcgtgacgc 
15901 tgccaccggt caacgtgtac ccgtgcgcac ccgtccccct cgcacttaga agatactgag 
15961 cagtctccga tgttgtgtcc cagcggcgag gatgtccaag cgcaaataca aggaagaaat 
16021 gctgcaggtt atcgcacctg aagtctacgg ccaaccgttg aaggatgaaa aaaaaccccg 
16081 ^caaaatcaag cgggtaaaaa aggacaaaaa jagaagaggaa gatggcgatg atgggctggc 
16141 ggagtttgtg cgcgagtttg ccccacggcg acgcgtgcaa tggcgtgggc gcaaagttcg 
16201 acatgtgttg agacctggaa cttcggtggt ctttacaccc ggcgagcgtt caagcgctac 
16261 ttttaagcgt tcctatgatg aggtgtacgg ggatgatgat attcttgagc aggcagctga 
16321 ccgattaggc gagtttgctt atggcaagcg tagtagaata aatcccaagg atgaaacagt 
16381 gtccataccc ttggatcatg gaaatcccac ccctagtctt aaaccggtca ctttgcagca 
16441 agtgttaccc gtaactccgc gaacaggtgt taaacgcgaa ggtgaagatt tgtatcccac 
16501 tatgcaactg atggtgccca aacgccagaa gttggaggac gttttggaga aagtaaaagt 
16561 ggatccagat attcaacctg aggttaaagt gagacccatt aagcaggtag cgcctggtct 
16621 gggagtacaa actgtagaca ttaaaattcc cactgaaagt atggaagtgc aaactgaacc 
16681 cgcaaagcct actgccacct ccactgaagt gcaaacggac ccatggatgc ccatgcctat 
16741 tacaactgac gccgtcggtc ccactcgaag atcccgacga aagtacggtc cagcaagtct 
16801 gttgatgccc aactatgtcg tacacccatc tattattcct actcctggtt accgaggcac 
16861 tcgctactat cgcagccgaa acagtacttc ccgccgtcgc cgcaagacac ctgcaaatcg 
16921 cagtcgtcgc cgtagacgca caagcaaacc gattcccggc gccctggtgc ggcaagtgta 
16981 ccgcaatggt agtgcggaac ctttgacact gccgcgtgcg cgttaccatc ctagtatcat 
17041 cacttaatca atgttgccgc tgcctccttg cagatatggc cctcacttgt cgccttcgcg 
17,101 ttcccatcac tggttaccga ggaagaaact cgcgccgtag aagagggatg ttggggcgcg 
17161 gaatgcgacg ctacaggcga cggcgtgcta tccgcaagca attgcggggt ggttttttgc 
17221 cagccttaat tccaattatc gctgctgcga ttggcgcaat accaggcata gcttccgtgg 
17281 cggttcaggc ctcgcaacga cattgacatt ggaaaaaaaa aaaacgtata aataaaaaat 
17341 acaatggact ctgacactcc tggtactgtg actatgtttt cttagagatg gaagacatca 
17401 atttttcatc cttggctccg cgacacgigca cgaagccgta catgggcacc tggagcgaca 
17461 tcggcacgag ccaactgaac gggggcgcct tcaattggag cagtatctgg agcgggctta 
17521 aeiaattttgg ctcaaccata aaeiacatacg ggaacaaagc ttggaacagc agtacaggac 
17581 aggcgcttag aaataaactt aaagaccaga acttccaaca aaaagtagtc gatgggatag 
17641 cttccggtat caatggagtg gtagatttgg ctaaccaggc tgtgcagaaa aagataaaca 
17701 gtcgtttgga cccgccgcca gcaaccccag gtgaaatgca agtggaggaa gaaattcctc 
17761 cgccagaaaa acgaggcgac aagcgtccgc gtcccgattt ggaagagacg ctggtgacgc 
17821 gcgtagatga accgccttct tatgaggaag caacgaagct tggaatgccc accactagac 
17881 cgatagcccc tatggccacc ggggtgatga aaccttctca gttgcatcga cccgtcacct 
17941 tggatttgcc ccctcctcct gctgctactg ctgtacccgc ttctaagcct gtcgctgccc 
18001 cgaaaccagt cgccgtagcc aggtcacgtc ccgggggcgc tcctcgtcca aatgcacact 
18061 ggcaaaatac tctgaacagc atcgtgggtc taggcgtgca aagtgtaaaa cgccgtcgct 
18121 gcttttaatt aaatatggag tagcgcttaa cttgcctatc tgtgtatatg tgtcattaca 
18181 cgccgtcaca gcatcagagg aaaaaaggaa gaggtcgtgc gtcgacgctg agttactttc 
18241 aagatggcca ccccatcgat gctgccccaa tgggcataca tgcacatcgc cggacaggat 
18301 gcttcggagt acctgagtcc gggtctggtg cagttcgccc gcgccacaga cacctacttc 
18361 aatctgggaa ataagtttag aaatcctacc gtagcgccga cccacgatgt gaccaccgat 
18421 cgtagccagc ggctcatgtt gcgcttcgtg cccgttgacc gggaggacaa tacatactct 
18481 tacaaagtgc ggtacaccct ggccgtgggc gacaacagag tgctggatat ggccagcacg 
18541 ttctttgaca ttaggggcgt gttggacaga ggtcccagtt ttaaacccta ttctggtacg 
18601 gcttacaact ccctggctcc taaaggcgct ccaaatgcat ctcagtggtt ggataaggga 
18661 gttacaagca ctggcctagt ggacgacggc aatactgatg atggggaaga agccaaaaaa 
18721 gcaacataca cttttggtaa tgctccagta aaagccgagg ctgaaatcac aaaagacgga 
18781 ttgccggtgg gcttggaagt ttcaactgaa ggtcctaaac caatctatgc tgataagctt 
18841 tatcagccag aacctcaagt gggagacgaa acttggactg acctagacgg aaaaaccgaa 
18901 gagtatggag ggagggttct taaacctgaa actaaaatga aaccctgcta cggatctttt 
18961 gctaaaccta ctaatattaa aggaggtcag gcaaaggtaa aaccaaaaga agacgatggc 
19021 actaacaaca tcgagtatga cattgacatg aacttctttg acttaagatc acaaagatca 
19081 gaactcaaac ctaaaattgt aatgtatgca gaaaatgtgg acctggaatg tccagatact 
19141 catgttgtgt acaaacctgg agtttcagat gctagttctg agaccaatct tggacaacag 
19201 tctatgccca acagacccaa ctacattggc ttcagagata acttcatcgg acttatgtac 
19261 tataacagta ctggcaacat gggggtactg gctggccaag cgtctcagtt gaatgcagtg 
19321 gttgacttgc aggacagaaa cacagaactg tcttaccaac tcttgcttga ctctctgggc 
. 19381 gacagaacca gatactttag catgtggaat caggctgtgg acagttatga tcctgatgta 
19441 cgtgttattg aaaatcatgg tgtggaagat gaacttccca actattgttt tccgttggat 
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19501 ggtgtcggtc cgcgaacaga tagttacaag gagattaagc caaatggaga ccaatctact 
19561 tggacaaatg tagacccaac tggcagcagt gaacttgcta agggaaatcc atttgccatg 
.19621 gaaattaacc ttcaagccaa tctatggcga agtttccttt attccaatgt ggctctatat 
19681 ctcccagact cgtacaaata caccccgtcc aatgtcactc ttccagaaaa caaaaacacc 
19741 tacgactaca tgaacgggcg ggtggtgccg ccatctctag tagacaccta tgtgaacatt 
19801 ggtgccaggt ggtctctgga tgccatggac aatgtcaacc cattcaacca ccaccgtaac 
19861 gctggcttgc gttaccgatc catgcttctg ggtaacggac gttatgtgcc tttccacata 
19921 caagtgcctc aaaaattctt cgctgttaaa aacctgctgc ttctcccagg ctcctacact 
19981 tatgagtgga actttaggaa ggatgtaaac atggttctac agagttccct cggtaacgac 
20041 ctacgggtag atggcgccag catcagtttt acgagcatca acctctatgc tacttttttc 
20101 cccatggctc acaacaccgc ttccaccctt gaagccatgc tgcggaatga caccaatgat 
20161 cagtcattca acgactacct atctgcagct aacatgctct accccattcc tgccaatgca 
20221 acceiatattc ccatttccat tccttctcgc aactgggcgg ctttcagagg ctggtcattt 
20281 accagactga aaaccaaaga aactccctct ttggggtctg gatttgaccc ctacttcgtc 
20341 tattctggtt ctattcccta cctggatggt accttctacc tgaaccacac ttttaagaag 
20401 gtttccatca tgtttgactc ttcagtgagc tggcctggaa atgacaggtt actatctcct 
20461 aacgaatttg aaataaagcg cactgtggat ggcgaaggct acaacgtagc ccaatgcaac 
20521 atgaccaaag actggttctt ggtacagatg ctcgccaact acaacatcgg ctatcagggc 
20581 ttctacattc cagaaggata caaagatcgc atgtattcat ttttcagaaa cttccagccc 
20641 atgagcaggc aggtggttga tgaggtcaat tacaaagact tcaaggccgt cgccataccc 
20701 taccaacaca acaactctgg ctttgtgggt tacatggctc cgaccatgcg tcaaggtcaa 
20761 ccctatcccg ctaactatcc ctatccactc attggaacaa ctgccgtaaa tagtgttacg 
20821 cagaaaaagt tcttgtgtga cagaaccatg tggcgcatac cgttctcaag caacttcatg 
20881 tctatgggag cccttacaga cttgggacag aacatgctct atgccaactc agctcatgct 
20941 ctggacatga cctttgaggt ggatcccatg gatgagccca ccctgcttta tcttctcttc 
21001 gaagttttcg acgtggtcag agtgcatcag ccacaccgcg gcatcatcga ggcagtctac 
21061 ctgcgtacac cgttctcggc cggtaacgct accacgtaag aagcttcttg cttcttgcaa 
21121 acagcagctg caaccatggc ctgcggatcc caaaacggct ccagcgagca agagctcaga 
21181 gccattgtcc aagacctggg ttgcggacca tattttttgg gaacctttga taagcgcttc 
21241 ccggggttca tggcccccga taagctcgcc tgtgccattg taaatacggc cggacgtgag 
21301 acggggggag agcactggtt ggctttcggt tggaacccac gttctaacac ctgctacctt 
21361 tttgatcctt ttggattctc ggatgatcgt ctcaaacaga tttaccagtt tgaatatgag 
21421 ggtctcctgc gccgcagcgc tcttgctacc aaggaccggt gtattacgct ggaaaaatct 
21481 acccagaccg tgcagggccc ccgttctgcc gcctgcggac ttttctgctg catgttcctt 
21541 catgcctttg tgcactggcc tgaccgtccc atggacggaa accccaccat gaaattgcta 
21.601 actggagtgc caaacaacat gcttcattct cctaaagtcc agcccaccct gtgtgacaat 
21661 caaaaagcac tctaccattt tctcaatacc cattcgcctt attttcgctc tcatcgtaca 
21721 cacatcgaaa gggccactgc gttcgaccgt atggatgtgc aataatgatt catgtaaaca 
21781 acgtgttcaa taaacagcac tttatttttt acatgtatcg aggctctgga ttacttattt 
21841 atttacaagt cgaatgggtt ctgacgagaa tcagaatgac ccgcaggcag tgatacgttg 
21901 cggaactgat acttgggttg ccacttgaat tcgggaatca ccaacttggg aaccggtata 
21961 tcgggcagga tgtcactcca cagctttctg gtcagctgca aagctcccag caggtcagga 
22021 gccgaaatct tgaaatcaca attaggacca gtgctctgag cgcgagagtt gcggtapacc 
22081 ggattgcagc actgaaacac catcagcgac ggatgtctta cgcttgccag cacggtggga 
22141 tctgcaatca tgcccacatc cagatcttca gcattggcaa tgctgaacgg ggtcatcttg 
22201 caggtctgcc tacccatggc gggcacccaa ttaggcttgt ggttacaatc gcagtgcagg 
22261 gggatcagta tcatcttggc ctgatcctgt ctgattcctg gatacacggc tctcatgaaa 
22321 gcatcatatt gcttgaaagc ctgctgggct ttactaccct cggtataaaa catcccgcag 
22381 gacctgctcg aaaactggtt agctgcgcag ccggcatcat tcacacagca gcgggcgtca 
22441 ttgttggcta tttgcaccac acttctgccc cagcggtttt gggtgatttt ggttcgctcg 
22501 ggattctcct tcaaggctcg ttgtccgttc tcgctggcca catccatctc gataatctgc 
22561 tccttctgaa tcataatatt gccatgcaag cacttcagct tgccctcata atcattgcag 
22621 ccatgaggcc acaacgcaca gcctgtacat tcccaattat ggtgggcgat ctgagaaaaa 
22681 gaatgtatca ttccctgcag aaatcttccc atcatcgtgc tcagtgtctt gtgactagtg 
22741 aaagttaact ggatgcctcg gtgctcctcg ttcacgtact ggtgacagat gcgcttgtat 
22801 tgttcgtgct gctcaggcat tagtttaaaa gaggttctaa gttcgttatc cagcctgtac 
22861 ttctccatca gcagacacat cacttccatg cctttctccc aagcagacac caggggcaag 
22921 ctaatcggat tcttaacagt gcaggcagca gctcctttag ccagagggtc atctttggcg 
22981 atcttctcaa tgcttctttt gccatccttc tcaacgatgc gcacgggcgg gtagctgaaa 
23041 cccactgcta caagttgcgc ctcttctctt tcttcttcgc tgtcttgact gatgtcttgc 
23101 atggggacat gtttggtctt ccttggcttc tttttcgggg gtatcggagg aggaggactg 
23161 tcgctccgtt ccggagacag ggaggattgt gacgtttcgc tcaccattac caactgactg 
23221 tcggtagaag aacctgaccc cacacggcga caggtgtttc tcttcggggg cagaggtgga 
23281 ggcgattgcg aagggctgcg gtccgacctg gaaggcggat gactggcaga accccttccg 
23341 cgttcggggg tgtgctccct gtggcggtcg cttaactgat ttccttcgcg gctggccatt 
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23401 gtgttctcct aggcagagaa acaacagaca 
23461 cacgagtgcc atcacatctc gtcctcagcg 
23521 caccgcccag tcctgccacc acctctaccc 
23581 acatgcagaa taaaaaagpg aaagagtctg 
23641 tgacaccggt ggaacacgag gaagagttga 
23701 gcccaaaaca gcaagcggat aactatcacc 
23761 actacctcat agggcttgac ggggaagacg 
23821 tcatagtcaa ggatgcatta ttggacagaa 
23881 gccgcgccta cgagcttaac .ctattttcac 
23941 gcacctgcga gccaaatcct cgcttaaact 
24001 tggctaccta tcacatcttt tttaaaaatc 
24061 gcacccgcgc cgatgcccta ctcaatctgg 
24121 ccttggaaga ggttccaaag atcttcgagg 
24181 atgctctgca aaagggagaa aatggcatgg 
24241 aaggcgataa tgccagactc gcagtactca 
.24301 accccgctgt caacctgccc cctaaagtca 
24361 agcgcgcaag tcccctttca gaagacatgc 
24421 cagtggtcag tgatgagcag ctaacccgat 
24481 aagagcgtcg caagcttatg atggccgtgg 
24541 ggcgtttctt taccgattca gaaaccttgc 
24601 ttagacacgg ctttgtgcgg caggcatgca 
24661 tttcctacat gggtattctg catgagaatc 
24721 ttaaggggga agcccgccgt gattacatcc 
24781 cgtggcaaac cggcatgggt gtatggcagc 
24841 taaacaagct cttacagaaa tctcttaagg 
24901 tcgcttccga cctggcagac ctcatcttcc 
24961 gactgcctga ctttatgagc cagagcatgc 
25021 gctccggtat cctgcccgcc acctgctgcg 
25081 accgcgaatg ccccccgccg ctatggagtc 
25141 tctcctacca ctcggatgtg atcgaggatg 
25201 gccgctgcaa tctgtgcacg ccccaccggt 
25261 aaacccagat aatagigcacc tttgaattgc 
25321 ctcctgggca eiagtttaaaa ctgaccccgg 
25381 ttgccccgga agattaccac ccctatgaaa 
25441 cgaaagccga actttcggcc tgcgtcatca 
25501 ccatccaaaa atcccgccaa gaatttctac 
25561 cccagaccgg cgaggaactc aacacaaggt 
25621 aagaagttga aggtgcagcc gccgccccca 
25681 gcagaggaag cggaggagga ggacagtctg 
25741 gaggaaaacg aggaggcaga ggaggtggaa 
25801 gctgcggaga caagcaacag cgctaccatc 
25861 cccagcagta gatgggacga gaccggacgc 
25921 ggtaagaagg atcggcaggg atacaagtcc 
25981 tgcttgcatg agtgcggggg caacatatcc 
26041 ggggtgaact ttccgcgcaa tgttttgcat 
26101 agccagcaaa tcccggcagt ctcgacagat 
2 6161 accagcagcg gcagttagaa aatacacaac 
26221 agccaacgag ccagcgcaaa cccgagagtt 
26281 catcttccag cagagtcggg gccaagagca 
26341 ttcgctcacc agaagttgtt tgtatcacaa 
26401 ggacgccgag gctctcttca acaagtactg 
26461 cgcgcttatt caaaaaaggc gggaattaca 
26521 cgccttacat gtggagttat cagccccaaa 
26581 actccacccg catgaattgg ctcagcgccg 
26641 tacgcgccta ccgaaaccaa atacttttgg 
26701 aacaccttaa tcccagaaat tggcccgccg 
26761 ccactgtatt acttcctcga gacgcccagg 
26821 agttagcggg cggctccacc ctatgtcgtc 
26881 tgatcagagg ccgaggtatc cagctcaacg 
26941 gaccagacgg aatctttcag attgccggct 
27001 ctgttctgac tttggaaagt tcgtcttcgc 
27061 aatttgtgga ggagtttact ccctctgtct 
27121 actacccgga cgagttcata ccgaacttcg 
27181 attgatgtct ggtgacgcgg ctgagctatc 
27241 ctttcgctgc tttgcccggg aactcattga 



tggaaactca gccattgctg tcaacatcgc 
acgaggaaaa ggagcagagc ttaagcattc 
tagaagataa ggaggtcgac gcatctcatg 
agccagacat cgaacaagac ccgggctatg 
aacgctttct agagagagag gatgaaaact 
aagatgctgg aaatagggat cagaacaccg 
cgctccttaa acatctagca agacagtcac 
ctgaagtgcc catcagtgtc gaagagctca 
ctcgtactcc jccccaaacgt cagccaaacg 
tttatccagc ttttgctgtg ccagaagtac 
aaaaaattcc agtctcctgc cgcgctaatc 
gacctggttc acgcttacct gatatagctt 
gtctgggcaa taatgagact cgggccgcaa 
atgagcatca cagcgttctg gtggaattgg 
agcgaagcgt cgaggtcaca cactttgcat 
tgacggccgt catggaccag ttactcatta 
atgacccaga tgcctgtgat gagggtaaac 
ggctgggcac cgactctccc cgggatttgg 
tgctggttac cgtagaacta gagtgtcttc 
gcaaactcga agagaatctg cactacactt 
agatatctaa cgtggaactc accaacctgg 
gcctaggaca aagcgtgctg cacagcaccc 
gcgattgtgt ttatctctac ctgtgccaca 
aatgtttaga agaacagaac ctgaaagagc 
ttctgtggac agggttcgac gagcgcaccg 
cagagcgtct cagggttact ttgcgaaacg 
ttaacaattt tcgctctttc atcctggaac 
cactgccctc cgactttgtg cctctcacct 
actgctacct gttccgtctg gccaactacc 
tgagcggaga cggcttgctg gagtgtcact 
ccctagcttg caacccccag ttgatgagcg 
aaggccccag cagccaaggc gatgggtctt 
gactgtggac ctccgcctac ttgcgcaagt 
tcaagttcta tgaggaccaa tcacagcctc 
cccagggggc aattctggcc caattgcaag 
tgaaaaaggg taagggggtc taccttgacc 
tccctcagga tgtcccaacg acgagaaagc 
gaagatatgg aggaagattg ggacagtcag 
gaggacagtc tggaggaaga cagtttggag 
gaagtaaccg ccgacaaaca gttatcctcg 
tccgctccga gtcgaggaac ccggcggcgt 
ttcccgaacc caaccagcgc ttccaagacc 
tggcgggggc ataagaatgc catcatctcc 
ttcacgcggc gctacttgct attccaccat 
tactaccgtc acctccacag cccctactat 
aaagacagcg gcggcgacct ccaacagaaa 
aagtgcagca acaggaggat taaagattac 
aagaaatcgg atctttccaa ccctgtatgc 
ggaactgaaa ataaaaaacc gatctctgcg 
gagcgaagat caacttcagc gcactctcga 
cgcgctgact cttaaagagt aggcagcgac 
tcatcctcga catgagtaaa gaaattccca 
tgggattggc ggcaggcgcc tcccaggact 
ggccttctat gatttctcga gttaatgata 
aacagtcagc tcttaccacc acgccccgcc 
ccctagtgta ccaggaaagt cccgctccca 
ccgaagtcca aatgactaat gcaggtgcgc 
acaggcctcg gcataatata aaacgcctga 
acgagtcggt gagctctccg cttggtctac 
gcgggagatc ttccttcacc cctcgtcagg 
aaccccgctc gggcggaatc gggaccgttc 
acttcaaccc cttctccgga tctcctgggc 
acgcgattag cgagtcagtg gacggctacg 
tcggctgcga catctagacc actgccgccg 
gttcatctac ttcgaactcc ccaaggatca 
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273.01 ccctcaaggt ccggcccacg gagtgcggat tactatcgaa ggcaaaatac actctcgcct 
27361 gcaacgaatt ttctcccagc ggcccgtgct gatcgagcga gaccagggaa acaccacggt 
27421 ttccatctac tgcatttgta atcaccccgg attgcatgaa agcctttgct gtcttatgtg 
27481 tactgagttt aataaaaact gaattaagac tctcctacgg actgccgctt cttcaacccg 
27541 gattttacaa ccagaagaac gaaacttttc ctgtcgtcca ggactctgtt aacttcacct 
27601 ttcctactca caaactagaa gctcaacgac tacaccgctt ttccagaagc attttcccta 
27661 ctaatactac tttcaaaacc ggaggtgagc tccaaggtct tcctacagaa aacccttggg 
27721 tggaagcggg ccttgtagtg ctaggaattc ttgcgggtgg gcttgtgatt attctttgct 
27781 acctatacac accttgcttc actttcctag tggtgttgtg gtattggttt aaaaaatggg 
27841 gcccatacta gtcttgcttg ttttactttc gcttttggaa ccgggttctg ccaattacga 
27901 tccatgtcta gacttcgacc cagaaaactg cacacttact tttgcacccg acacaagccg 
27961 catctgtgga gttcttatta agtgcggatg ggaatgcagg tccgttgaaa ttacacacaa 
28021 taacaaaacc tggaacaata ccttatccac cacatgggag ccaggagttc ccgagtggta 
28081 cactgtctct gtccgaggtc ctgacggttc catccgcatt agtaacaaca ctttcatttt 
28141 ttctgaaatg tgcgatctgg ccatgttcat gagcaaacag tattctctat ggcctcctag 
28201 caaggacaac atcgtaacgt tctccattgc ttattgcttg tgcgcttgcc ttcttactgc 
28261 tttactgtgc gtatgcatac acctgcttgt aaccactcgc atcaaaaacg ccaataacaa 
28321 agaaaaaatg ccttaacctc tttctgttta cagacatggc ttctcttaca tctctcptat 
28381 ttgtcagcat tgtcactgcc gctcacggac aaacagtcgt ctctatccct ctaggacata 
28441 attacactct cataggaccc ccaatcactt cagaggtcat ctggaccaaa ctgggaagcg 
28501 ttgattactt tgatataatc tgcaacaaaa caaaaccaat aatagtaact tgcaacatac 
28561 aaaatcttac attgattaat gttagcaaag tttacagcgg ttactattat ggttatgaca 
28621 gatacagtag tcaatataga aattacttgg ttcgtgttac ccagttaaaa accacgaaaa 
28681 tgccaaatat ggcaaagatt cgatccgatg acaattctct agaaactttt acatctccca 
28741 ccacacccga cgaaaaaaac atcccagatt caatgattgc aattgttgca gcggtggcag 
28801 tggtgatggc actaataata atatgcatgc ttttatatgc ttgtcgctac aaaaagtttc 
28861 atcctaaaaa acaag'atctc ctactaaggc ttaacattta atttcttttt atacagccat 
28921 ggtttccact accacattcc ttatgcttac tagtcttgca actctgactt ctgctcgctc 
28981 acacctcact gtaactatag gctcaaactg cacactaaaa ggacctcaag gtggtcatgt 
29041 cttttggtgg agaatatatg acaatggatg gtttacaaaa ccatgtgacc aacctggtag 
29101 atttttctgc aacggcagag acctaaccat tatcaacgtg acagcaaatg acaaaggctt 
29161 ctattatgga accgactata aaagtagttt agattataac attattgtac tgccatctac 
29221 cactccagca ccccgcacsia ctactttctc tagcagcagt gtcgctaaca atacaatttc 
29281 caatccaacc tttgccgcgc ttttaaaacg cactgtgaat aattctacaa cttcacatac 
29341 aacaatttcc acttcaacaa tcagcattat cgctgcagtg acaattggaa tatctattct 
29401 tgtttttacc ataacctact acgcctgctg ctatagaaaa gacaaacata aaggtgatcc 
29461 attacttaga tttgatattt aatttgttct tttttttttt atttacagta tggtgaacac 
29521 caatcatggt acctagaaat ttcttcttca ccatactcat ttgtgcattt aatgtttgcg 
29*581 ctactttcac agcagtagcc acagcaaccc cagactgtat aggagcattt gcttcctatg 
29641 cactttttgc ttttgttact tgcatctgcg tatgtagcat agtctgcctg gttattaatt 
29701 ttttccaact tctagactgg atccttgtgc gaattgccta cctgcgccac catcccgaat 
29761 accgcaacca aaatatcgcg gcacttctta gactcatcta aaaccatgca ggctatacta 
29821 ccaatatttt tgcttctatt gcttccctac gctgtctcaa ccccagctgc ctatagtact 
29881 ccaccagaac accttagaaa atgcaaattc caacaaccgt ggtcatttct tgcttgctat 
29941 cgagaaaaat cagaaattcc cccaaattta ataatgattg ctggaataat taatataatc 
30001 tgttgcacca taatttcatt tttgatatac cccctatttg attttggctg gaatgctccc 
30061 aatgcacatg atcatccaca agacccagag gaacacattc ccctacaaiaa catgcaacat 
30121 ccaatagcgc taatagatta cgaaagtgaa ccacaacccc cactactccc tgctattagt 
30181 tacttcaacc taaccggcgg agatgactga aacactcacc acctccaatt ccgccgagga 
30241 tctgctcgat atggacggcc gcgtctcaga acagcgactt gcccaactac gcatccgcca 
30301 gcagcaggaa cgcgcggcca aagagctcag agatgtcatc caaattcacc aatgcaaaaa 
30361 aggcatattc tgtttggtaa aacaagccaa gatatcctac gagatcaccg ctactgacca 
30421 tcgcctctct tacgaacttg gcccccaacg acaaaaattt acctgcatgg tgggaatcaa 
30481 ccccatagtt atcacccagc aaagtggaga tactaagggt tgcattcact gctcctgcga 
30541 ttccatcgag tgcacctaca ccctgctgaa gaccctatgc ggcctaagag acctgctacc 
30601 aatgaattaa aaaatgatta ataaaaaatc acttacttga aatcagcaat aaggtctctg 
30661 ttgaaatttt ctcccagcag cacctcactt ccctcttccc aactctggta ttctaaaccc 
30721 cgttcagcgg catactttct ccatacttta aaggggatgt caaattttag ctcctctcct 
30781 gtacccacaa tcttcatgtc tttcttccca gatgaccaag agagtccggc tcagtgactc 
30841 cttcaaccct gtctacccct atgaagatga aagcacctcc caacacccct ttataaaccc 
30901 agggtttatt tccccaaatg gcttcacaca aagcccagac ggagttctta ctttaaaatg 
30961 tttaacccca ctaacaacca caggcggatc tctacagcta aaagtgggag ggggacttac 
31021 agtggatgac actgatggta ccttacaaga aaacatacgt gctacagcac ccattactaa 
31081 aaataatcac tctgtagaac tatccattgg aaatggatta gaaactcaaa acaataaact 
31141 atgtgccaaa ttgggaaatg ggttaaaatt taacaacggt gacatttgta taaaggatag 
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31201 tattaacacc ttatggactg gaataaaccc tccacctaac tgtcaaattg tggaaaacac 
31261 taatacaaat gatggcaaac ttactttagt attagtaaaa aacggagggc ttgttaatgg 
31321 ctacgtgtct ctagttggtg tatcagacac tgtgaaccaa atgttcacac aaaagacagc 
31381 aaacatccaa ttaagattat attttgactc ttctggaaat ctattaactg atgaatcaga 
31441 cttaaaaatt ccacttaaaa ataaatcttc tacagcgacc agtgaaactg tagccagcag 
31501 caaagccttt atgccaagta ctacagctta tcccttcaac accactacta gggatagtga 
31561 aaactacatt catggaatat gttactacat gactagttat gatagaagtc tatttccctt 
31621 gaacatttct ataatgctaa acagccgtat gatttcttcc aatgttgcct atgccataca 
31681 atttgaatgg aatctaaatg caagtgaatc tccagaaagc aacatagcta cgctgaccac 
31741 atcccccttt ttcttttctt acattacaga agacgacaac taaaataaag tttaagtgtt 
31801 tttatttaaa atcacaaaat tcgagtagtt attttgcctc caccttccca tttgacagaa 
31861 tacaccaatc tctccccacg cacagcttta aacatttgga taccattaga gatagacatt 
31921 gttttagatt ccacattcca aacagtttca gagcgagcca atctggggtc agtgatagat 
31981 aaaaatccat cgcgatagtc ttttaaagcg ctttcacagt ccaactgctg cggatgcgaa 
32041 tccggagtct ggatcacggt catctggaag aagaacgatg ggaatcataa tccgaaaacg 
32101 gtatcggacg attgtgtctc atcaaaccca caagcagccg ctgtctgcgt cgctccgtgc 
32161 aactgctgtt tatgggatca gggtccacag tgtcctgaag catgatttta atagccctta 
32221 acatcaactt tctggtgcga tgcgcgcagc aacgcattct gatttcactc aaatctttgc 
32281 agtaggtaca acacattatt acaatattgt ttaataaacc ataattaaaa gcgctccagc 
32341 caaaactcat atctgatata atcgcccctg catgaccatc ataccaaagt ttaatataaa 
32401 ttaaatgacg ttccctcaaa aacacactac ccacatacat gatctctttt ggcatgtgca 
32461 tattaacaat. ctgtctgtac catggacaac gttggttaat catgcaaccc aatataacct 
32521 tccggaacca cactgccaac accgctcccc cagccatgca ttgaagtgaa ccctgctgat 
32581 tacaatgaca atgaagaacc caattctctc gaccgtgaat cacttgagaa tgaaaaatat 
32641 ctatagtggc acaacataga cataaatgca tgcatcttct cataattttt aactcctcag 
32701 gatttagaaa catatcccag ggaataggaa gctcttgcag aacagtaaag ctggcagaac 
32761 aaggaagacc acgaacacaa cttacactat gcatagtcat agtatcacaa tctggcaaca 
32821 gcgggtggtc ttcagtcata gaagctcggg tttcattttc ctcacaacgt ggtaactggg 
32881 ctctggtgta agggtgatgt ctggcgcatg atgtcgagcg tgcgcgcaac cttgtcataa 
32941 tggagttgct tcctgacatt ctcgtatttt gtatagcaaa acgcggccct ggcagaacac 
33001 actcttcttc gccttctatc ctgccgctta gcgtgttccg tgtgatagtt caagtacagc 
33061 cacactctta agttggtcaa aagaatgctg gcttcagttg taatcaaaac tccatcgcat 
33121 ctaattgttc tgaggaaatc atccacggta gcatatgcaa atcccaacca agcaatgcaa 
33181 ctggattgcg tttcaagcag gagaggagag ggaagagacg gaagaaccat gttaattttt 
33241 attccaeiacg atctcgcagt acttcaaatt gtagatcgcg cagatggcat ctctcgcccc 
33301 cactgtgttg gtgaaaaagc acagctaaat caaaagaaat gcgattttca aggtgctcaa 
33361 cggtggcttc caacaaagcc tccacgcgca catccaagaa caaaagaata ccaaaagaag 
33421 gagcattttc taactcctca atcatcatat tacattcctg caccattccc agataatttt 
33481 cagctttcca gccttgaatt attcgtgtca gttcttgtgg taaatccaat ccacacatta 
33541 caaacaggtc ccggagggcg ccctccacca ccattcttaa acacaccctc ataatgacaa 
33601 aatatcttgc tcctgtgtca cctgtagcga attgagaatg gcaacatcaa ttgacatgcc 
33661 cttggctcta agttcttctt taagttctag ttgtaaaaac tctctcatat tatcaccaaa 
33721 ctgcttagcc agaagccccc cgggaacaag agcaggggac gctacagtgc agtacaagcg 
33781 cagacctccc caattggctc cagcaaaaac aagattggaa taagcatatt gggaaccgcc 
33841 agtaatatca tcgaagttgc tggaaatata atcaggcaga gtttcttgta aaaattgaat 
33901 aaaagaaaaa tttgccaaaa aaacattcaa aacctctggg atgcaaatgc aataggttac 
33961 cgcgctgcgc tccaacattg ttagttttga attagtctgc aaaaataaaa aaaaaaacaa 
34021 gcgtcatatc atagtagcct gacgaacagg tggataaatc agtctttcca tcacaagaca 
34081 agccacaggg tctccagctc gaccctcgta aaacctgtca tggtgattaa acaacagcac 
34141 cgaaagttcc tcgcggtgac cagcatgaat aattcttgat gaagcataca atccagacat 
34201 gttagcatca gttaacgaga aaaaacagcc aacatagcct ttgggtataa ttatgcttaa 
34261 tcgtaagtat agcaaagcca cccctcgcgg atacaaagta aaaggcacag gagaataaaa 
34521 aatataatta tttctctgct gctgttcagg caacgtcgcc cccggtccct ctaaatacac 
34381 atacaaagcc tcatcagcca tggcttacca gacaaagtac agcgggcacg cacaagctct 
34441 aaagtcactc tccaacctct ccacaatata tatacacaag ccctaaactg acgtaatggg 
34501 agtaaagtgt aaaaaatccc gccaeiaccca acacacaccc cgaaactgcg tcaccaggga 
34561 aaagtacagt ttcacttccg caatcccaac aagcgtcact tcctctttct cacggtacgt 
34621 cacatcccat taacttgcaa cgtcattttc ccacggccgc gccgccccgt ttagccgtta 
34681 accccacagc caatcaccac acaccccaca atttttaaaa tcacctcatt tacatattgg 
34741 caccattcca tctataaggt atattattga tgatg 
SEQ ID NO: 12 
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